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AN  INVESTIGATION  OF  A  CxENERAL  NORMALITY  OR 
CONTROL  FACTOR  IN   PERSONALITY  TESTING^ 

Chapter  I 
Introduction  and  Problim 


THE  present  investigation  had  its  ori- 
gin  in   several    converging  lines   of 
thought,  beginning  with  a  consideration 
of  Rosanoff's  theory  of  temperament  as 
described  briefly  in  his  well-known  man- 
ual (30),  and  related  specifically  to  the 
problem  jjresented  by  persons  obtaining 
markedly  abnormal  scores  on  the  Minne- 
sota Multiphasic  Personality  Inventory^ 
but  managing  to  stay  out  of  the  hands 
of   the   psychiatrist.   Although   it   is   ex- 
pected on  purely  statistical  grounds  that 
about    1    in   40   people   in   the   "general 
jiopulation"  will  receive  scores  more  than 
two  standard  deviations  above  the  mean 
on  any  given  scale,  still  it  is  of  interest 
to  consider  such  deviates  with  regard  to 
their  psychiatric  condition.  This  is  par- 
ticularly   true    because    some    of    these 
scores  are  so  markedly  deviant,  i.  c,  four 
or    five    standard    deviations    above    the 
"normal"  mean,  that  they  cannot  reason- 
ably be  looked  upon  as  merely  the  tail 
cases  of  a  normal  distribution  from  the 
same  supply  as  the  normals  whose  statis- 
tics define  the  probabilities  in  question. 
Furthermore,  it  is  not  possible  to  dismiss 
all  of  these  cases  as  simply  "test  misses" 
(not   in   itself  a   very   descriptive   term) 
when  closer  acquaintance  typically  shows 
them  to  be  characterized  by  rather  ab- 
normal amounts  of  the  traits  in  question 


^  This  paper  is  a  revision  of  the  thesis  suli- 
mittcd  to  the  Department  of  Psvcholosry  of  the 
University  of  Minnesota  in  partial  fulfilment  of 
the  requirements  for  the  degree  of  Doctor  of 
Philosophy,  March,  1945. 

^  Hencefortli  the  Minnesota  Multiphasic  Per- 
son;iiitv  Inventory  will  he  referred  to  for  hrevity 
as  MM  PI. 


on    clinical    grounds.    For   example,   we 
find  a  "normal"  subject  who  shows  a  de- 
pression   score    3.5    standard    deviations 
above  the  mean  of  unselected  "normals," 
who  has  no  language  handicap  or  intelli- 
gence deficit  which  would  invalidate  his 
test  results,  and  who  complains  of  feeling 
blue,    as    if    nothing   was    worth    while, 
and  of  being  filled  with  a  continual  rumi- 
native   anxiety.    Nonetheless,    we    find 
him  attending  his  classes,  achieving  scho- 
lastic   success,    making    adequate    social 
adjustments,    and    in    general    behaving 
in  such  a  way  that  no  psychiatrist  would 
consider  him  worthy  of  therapeiuic  at- 
tention unless  he  had  no  really  ill  pati- 
ents as  competitors.  On  the  other  hand, 
one  can  find  numerous  patients  in   the 
psychopathic  unit  of  the  University  Hos- 
pitals who  are  diagnosed  as  psychotic  or 
reactive  depressions   and  whose  depres- 
sion  has  left   them  effectively  incapaci- 
tated for  ordinary  social  and  economic 
interaction  to  the  extent  that  they  are  in 
an  institution  under  psychiatric  care,  yet 
with  depression  scores  a  standaid  devi- 
ation or  more  below  that  of  this  first  case 
considered.  The   immediate  problem  is 
"How  is  such  a  thing  possible?" 

Rosanoff's  theory  of  temperament, 
which  was  psychometrically  embodied  in 
the  Humm-Wadsworth  Temperament 
Scale,  may  be  briefiy  summarized  at  this 
point.  He  considers  three  major  com- 
ponents tending  toward  abnormality  as 
manifested  in  full-blown  psychiatric  con- 
ditions to  be  present  also  in  lesser  degrees 
among  "normal"  individuals.  These  com- 
ponents he   calls   the  antisocial  compo- 
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nent,  characterized  by  "imposed"  moral- 
ity and  a  tendency  to  hysteroid  and 
psychopathic  manifestations;  the  cyclo- 
tJiymic  component,  characterized  by 
mood  swings,  emotional  lability,  and  a 
tendency  to  manic-depressive  reactions; 
and  the  clinotic  sexuality  component, 
identifiable  roughly  with  the  tendency  to 
schizophrenia.  Relatively  independent  of 
these  three  abnormal  components  he 
posits  a  fourth  factor  which  is  the  "nor- 
mal," inhibiting,  or  controlling  factor  of 
temperament.  The  theory  is  that  if  one 
possesses  a  sufficiently  great  amoinit  of 
this  general  normality  or  control  factor 
in  his  make-up,  he  will  be  able  to  stay 
out  of  psychiatric  difficulties  and  will 
successfully  restrain  any  deviation  that 
may  exist  in  one  of  the  others.  He  admits 
that  the  evidence  for  this  theory  is  at 
present  sketchy  and  inadequate,  but  still 
suggestive  enough  to  warrant  its  serious 
consideration  and  investigation.  The 
lines  of  evidence  which  he  adduces  in  its 
behalf  are  ontogenetic,  pharmacologic, 
pathological,  and  data  from  the  phe- 
nomena of  senile  involution. 

The  ontogenetic  finding  which  sug- 
gests the  presence  of  a  general  inhibiting 
and  controlling  factor  of  personality  is 
the  appearance  of  antisocial,  cyclothymic, 
and  chaotic-sexual  behavior  in  children 
which  in  most  cases  is  "outgrown,"  and 
which  when  it  appears  in  children  does 
not  afford  ground  for  such  pessimism  as 
to  ultimate  outcome  as  the  same  be- 
havior does  when  appearing  in  the  "ma- 
tured" adult.  Rosanoff  suggests  that  this 
is  due  to  a  differential  rate  of  maturation 
of  the  (genetic)  factors  for  the  normal 
component  in  different  persons,  and  that 
the  appearance  of  such  "abnormal" 
trends  in  children  who  turn  out  to  be 
normal  in  adult  life  is  due  to  the  greater 
slowness  of  maturation  of  this  inhibiting, 


controlling  factor  of  temperament.  Those 
in  Avhom  the  genetic  basis  for  such  a 
factor  is  basically  weaker  never  grow  out 
of  the  abnormal  manifestations  and  as 
adults  have  a  poor  prognosis. 

In  the  pharmacologic  realm,  Rosanoff 
stresses  as  especially  significant  the  wide 
individual  dilferences  in  reactivity  to 
alcohol.  In  persons  who  are  normally 
controlled  and  inhibited,  the  effect  of 
paralyzing  the  higher  nervous  centers 
through  alcohol  is  to  release  very  differ- 
ent kinds  of  behavior.  In  some  persons, 
chaotic  sexuality  makes  its  appearance, 
e.  g.,  the  isolated  appearance  of  homo- 
sexual manifestations  in  superficially 
"normal"  males  under  the  influence  of 
alcohol.  Others  show  antisocial,  still 
others  cyclothymic  behavior.  The  hypoth- 
esis is  that  in  the  normal  condition 
these  components  are  already  present  in 
all  persons  (although  in  varying  degrees), 
and  that  the  direction  of  the  uninhibited 
behavior  is  a  function  of  the  relative 
strength  of  the  ordinarily  controlled  and 
inhibited  tendencies  in  either  of  these 
three  directions. 

The  argument  in  the  case  of  data  from 
organic  pathology  and  senile  involution 
is  practically  identical.  In  the  early 
stages  of  diffuse  cerebral  lesions,  before 
the  general  mental  deterioration  of  ad- 
vanced cerebral  pathology  has  super- 
vened, phenomena  of  a  schizophrenic, 
cyclothymic,  and  antisocial  nature  occur. 
It  is  common  knowledge  in  psychiatry 
that  although  certain  kinds  of  response 
may  be  expected  more  often  than  others, 
depending  upon  the  nature  of  the 
pathology,  nevertheless  there  is  consider- 
able variation  in  the  behavior  syndrome 
which  occurs  when  the  central  nervous 
system  is  ravaged  by  organic  disease.  For 
example,  two  persons  have  been  ade- 
quately controlled  and  socially  adapted 


Mi. 


A    GENERAL    NORMALITY    OF    CONTROL    FACTOR    L\    PERSONALITY    TESTLNG 


prior  to  becoming  ill.  Botli  individuals 
become  paretic  in  the  late  stages  of 
syphilis,  and  one  shows  euphoria,  ex- 
pansiveness,  and  many  of  the  symptoms 
of  an  atypical  affective  psychosis.  The 
other  may  instead  become  irascible,  para- 
noid, and  hallucinated.  "Here  again, 
these  observations  suggest  that  there  ex- 
ists in  the  cerebrum  an  inhibiting  and 
controlling  ajjparatus;  that  the  functions 
of  this  apparatus  are  among  the  most 
vulnerable  of  all  the  cerebral  functions 
and  are  among  the  earliest  to  be  lost  in 
cases  of  diffuse  cerebral  lesions;  and  that 
the  particular  manifestations  that  appear 
in  a  given  case,  upon  the  loss  of  the 
inhibiting  and  controlling  functions,  de- 
pend on  the  particular  temperamental 
components  that  have  already  existed  in 
the  individual,  though  in  a  state  of  com- 
plete or  partial  latency."  (30,  pp.  667- 
668.) 

Although  this  conception  of  tempera- 
ment is  not  widely  accepted  as  such, 
anvone  who  works  with  psychiatrists 
hears  them  frequently  classifying  patients 
into  categories  that  are  at  least  closely 
similar  in  their  connotation  to  Rosanoff's 
control  factor.  Thus  two  patients  with  a 
similar  picture  of  monosymptomatic  con- 
version hysteria  may  be  sharply  separated 
on  the  grounds  that  one  has  more  in- 
tegration, is  better  controlled,  "keeps 
her  neurosis  in  hand"  to  a  greater  extent, 
and  similar  characterizations. 

The  problem  as  originally  posed  could 
be  stated  something  like  this:  Is  there  a 
fairly  stable  trait  (variable,  characteristic) 
of  persons,  with  regard  to  which  there 
exist  wide  individual  differences,  that  is 
relatively  uncorrelated  with  abnormal 
components  of  personality  but  which,  if 
present,  lessens  the  likelihood  that  a 
large  amount  of  some  abnormal  compon- 
ent will  result  in  psychiatric  breakdown? 


If  there  is,  how  can  it  be  measured?  If  it 
can  be  measured,  what  correlates  does  it 
have,  what  is  its  behavioral  dynamic  na- 
ture, and  what  are  the  people  like  who 
have  a  lot  or  a  little  of  it? 

The  theoretical  importance  of  such  a 
factor  is  too  obvious  to  require  discus- 
sion. On  the  other  hand,  even  if  its 
psychological  nature  were  not  elucidated 
satisfactorily,  the  practical  importance 
of  having  a  means  of  measuring  it  would 
be  considerable.  It  would  constitute  an- 
other technique  for  reducing  the  errors 
of  prediction  and  classification  using  a 
psychometric  instrument  such  as  MMPI. 
It  is  also  conceivable  that  it  would  have 
prognostic  and  therapeutic  significance, 
for  example,  in  estimating  the  probabili- 
ties of  staying  well  after  discharge  follow- 
ing psychotherapy.  In  the  industrial  and 
other  situations  in  which  inspection  of  a 
test  profile  is  expected  to  yield  a  predic- 
tion of  successful  adjustment  in  the 
prospective  situation,  any  score  which 
would  differentiate  those  persons  who 
have  deviant  tendencies  but  can  be  rea- 
sonably expected  to  remain  "adjusted" 
by  some  minimal  standard  from  those 
who  with  scores  no  more  deviant  can  be 
expected  to  prove  inadequate,  would  be 
of  tremendous  advantage.  For  these 
several  reasons  both  theoretical  and  prag- 
matic the  development  of  such  a  scale 
(either  of  a  very  general  nature  or  more 
specifically  in  respect  to  some  particular 
test)  would  be  of  considerable  value. 

A  survey  of  the  literature  fails  to  reveal 
any  empirical  investigations  of  real 
relevance  and  merit,  with  almost  no  ex- 
ceptions. A  careful  study  of  articles  in  the 
Psychological  Abstracts  under  the  head- 
ings abnormality,  adjustment,  character? 
constitution,  control,  delinquency,  frus- 
tration, inhibition,  neuroticism,  normal- 
ity,  personality,   persistence,   psychoneu- 
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rosis,  psychopathy,  rigidity,  stability, 
stress,  temperament,  volition,  and  -will 
yields  practically  no  material  devoted  to 
this  aspect  of  personality  testing,  al- 
though there  appear  a  number  of  discus- 
sions regarding  such  a  hyi)othetical  fac- 
tor of  personality,  mider  \arious  guises 
and  names.  Of  greatest  (and  in  fact  the 
only  obvious)  relevance  is  the  work  of 
Hiunm  and  ^Vads^vorth,  constructing  a 
temperament  scale  with  Rosanoff's  theory 
in  mind  (17.  18.  19,  20,  21,  22).  Of  slight 
relevance  one  may  mention  studies  of 
"persistence"  (15),  the  theory  of  "frus- 
tration tolerance,"  (31,  32,  33),  and  Free- 
man's work  on  a  "psychological  plimsoU 
mark"  (8).  Freeman's  is  the  only  one  of 
the  last  three  which  uses  a  question- 
answer  test  to  measure  the  hypothetical 
factor  involved,  the  test  in  this  case  being 
the  "normal"  component  of  the  Humm- 
^Vads^\•orth  scale. 

This  last  scale  consists  of  38  items 
selected  empirically  by  the  authors  on 
the  basis  of  overall  differentiation  be- 
tween a  group  of  so-called  "normal"  per- 
sons who  were  defined  so  by  being 
company  employees  adjusting  satisfac- 
torily to  their  work  and  showing  no 
evidence  of  abnormal  trends  in  their  case 
studies,  and  a  heterogeneous  group  of 
patients  committed  to  state  hospitals  as 
the  "abnormals." 

In  the  application  of  the  Humm- 
"Wadsworth  scale  to  clinical  diagnosis,  the 
assumption  was  essentially  based  on  the 
theory  of  Rosanoff,  in  that  a  subject  with 
a  deviant  score  on  a  given  scale,  e.  g., 
schizoid-autistic,  would  not  be  considered 
as  probably  maladjusted  to  a  serious  de- 
gree if  his  "normal"  component  were 
also  high,  since  the  latter  would  act  as  an 
inhibitor  or  "brake"  on  the  appearance 
of  his  abnormal  tendency.  Whether  one 
considers    this    to   have   worked   or   not 


depends  upon  the  evaluation  he  places 
on  the  Hunmi-\Vadsworth,  concerning 
which  the  evidence  is  somewhat  conflict- 
ing (2,  5.  6,  18,  19,  22,  23).  At  any  rate 
the  problem  of  whether  the  scale  called 
"normal"  measures  the  hypothetical  nor- 
mality factor  of  Rosanoff's  theory  is  not 
solved  this  easily,  since  such  a  set  of 
items  might  serve  the  purely  psycho- 
metric fiuiction  required  by  the  theory 
but  on  an  entirely  different  basis. 

The  items  on  the  Humm-Wadsworth 
scale  are  not  psychologically  homogene- 
ous in  any  obvious  way.  However,  for 
the  vast  majority  of  these  items  two 
things  at  least  can  be  said.  Firstly,  they 
are  scored  for  "normality"  when  an- 
swered in  the  direction  that  a  naive 
person  would  consider  more  normal  and 
adjusted  by  inspecting  the  item  content, 
though  this  fact  means  less  than  many 
persons  might  think  when  one  considers 
what  is  found  in  purely  empirical  item 
analyses  such  as  done  by  Humm  and 
Wadsworth  or  by  the  authors  of  the  Min- 
nesota Multiphasic  Personality  Inven- 
tory. Humm  and  Wadsworth  do  not  give 
the  statistically  preferred  response  for  the 
general  population,  so  it  is  impossible 
to  say  whether  the  response  called  "nor- 
mal" is  that  of  the  majority  of  normal 
persons.  Nevertheless  the  method  used 
by  these  authors  in  item  selection  was  in 
this  case  the  differentiation  of  their 
heterogeneous  group  of  "abnormals"  (de- 
fined by  being  institution  inmates)  from 
a  group  of  persons  rated  as  normal  and 
making  satisfactory  vocational  adjust- 
ment (17,  p.  167).  So  that  whether  the 
item  response  scored  "normal"  on  the 
Humm-Wadsworth  is  the  response  given 
by  the  majority  of  normal  persons  or 
not,  at  least  it  is  known  to  be  the  re- 
sponse given  significantly  more  often  by 
normal  persons  than  by  abnormal  per- 
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sons  considered  as  a  class.  Many  of  the 
items  on  the  Humm-\Vadsworth  normal 
scale  are  found,  in  slightly  altered  form, 
in  the  MMPI.  Inspection  of  the  scoring 
of  these  items  on  MMPI  shows  that  88 
l^er  cent  of  the  Hwmm-\Vadsworth  "nor- 
mal" items  whi(h  appear  in  almost 
identical  form  in  MMPI  are  responded 
to  by  the  majority  of  the  MMPI  general 
population  sample  in  the  same  direction 
that  is  scored  "normal"  on  the  Humm- 
Wadsworth.  This  fact,  combined  with 
the  item  content,  suggests  rather  strongly 
that  to  achieve  a  high  "normal"  score  on 
the  Humm-Wadsworth  it  is  necessary 
to  avoid  saying  psychiatrically  "bad" 
things  about  yourself  of  a  sort  which  are 
so  general  among  all  varieties  of  abnor- 
mals  that  they  show  a  differentiation 
when  this  latter  group  is  considered  with- 
out fmther  breakdow^n.  Examples  of  the 
H-AV  "normal"  items  are  such  responses 
as  the  following: 

Have  your  hardest  battles  been  with  your- 
self? (Yes) 

Has  more  than  one  person  called  you 
hotheaded?  (No) 

Have  you  lost  out  in  several  undertakings 
by  not  making  up  yoiu-  mind  quickly 
enough?  (No) 

Have  you  several  times  worked  under 
people  who  seemed  to  have  things  fixed  so 
that  they  would  get  the  credit  for  good  work, 
and  those  under  them  would  be  blamed  for 
mistakes?  (No) 

Have  you  at  times  felt  obliged  to  ask  your 
friends  to  help  you  personally  even  though 
you  could  not  return   the  favor?  (No) 

Do  you  think  most  people  inwardly  dislike 
putting  themselves  out  to  help  other  people? 
(No) 

Is  it  always  best  to  keep  your  mouth  shut 
when  you  are  in   trouble?  (No) 

Have  you  ever  taken  up  a  line  of  work 
which  called  for  close  attention  to  fine  de- 
tail, to  the  exclusion  of  other  activities?  (No) 

Have  you  ever  stayed  away  from  another 
person  because  you  feared  doing  or  saying 
something  you  might  afterwards  regret?  (No) 


Have  you  ever  felt  that  difficulties  were 
piling  up  to  such  an  extent  that  you  could 
not  overcome  them?  (No) 

Have  you  at  times  been  certain  that  people 
were  talking  about  you  although  you  couldn't 
prove  it?  (No) 

Although  there  are,  as  always  in  a  test 
of  this  sort,  some  responses  of  a  puzzling 
and  seemingly  irrational  character  scored 
in  a  certain  way,  this  list  of  items  (every 
third  item  in  the  H-W  normal  scale) 
indicates,  I  think,  the  general  tendency 
for  the  "normal"  scored  response  to  con- 
sist of  a  denial  of  a  trait  or  symptom 
which  if  present  would  indicate  malad- 
justment. This  observation  may  seem 
utterly  trivial  but  the  importance  of  it 
will  become  clear  when  the  interpreta- 
tion of  the  results  of  the  present  study  is 
discussed. 

A  further  relevant  finding  is  that  the 
so-called  "normal"  component  shows 
substantial  negative  correlations  with 
several  of  the  other  components  as  found 
by  three  investigators  (2,  6,  23),  although 
for  some  unaccountable  reason  the  co- 
efficients between  the  normal  component 
and  other  components  are  reported  as 
positive  by  Humm  and  Wadsworth.  The 
highest  negative  correlations  (ranging 
from  —.53  to  —.78)  reported  by  these 
other  three  investigators  are  with  the 
depression  and  paranoid  scales.  The  cor- 
relation of  the  "no-count,"  as  would  be 
expected  from  the  list  of  items  in  the 
sample  above,  is  markedly  positive  with 
"normal." 

It  is  not  here  denied  that  the  H-\V 
"normal"  component  is  measuring  the 
hypothetical  inhibiting,  controlling 
"brake"  factor  of  Rosanoff's  theory.  But 
it  is  obvious  that  the  differentiation  of 
heterogeneous  abnormals  from  normal, 
adjusted  persons  by  a  given  item  does 
not  establish  prima  facie  that  it  is  meas- 
uring such  a  component.  The  scale,  the 


G 


PAUL   E.    MEEHL 


internal  consistency  of  which  is  indicated 
by    Himim    and    ^Vadsworth's   split-half 
reliability  coefficient  of  .82   (17),  might 
^veU   be   measuring  some   other   compo- 
nent of  personality  than  the  one  hypoth- 
esized   by    Rosanoff   and    still    function 
statistically  in  the  way  suggested  by  tlic 
authors.  To  mention  only  two  possibili- 
ties, it  might  be  measuring  the  overall 
freedom  from  reality-distortion  that  ap- 
pears in  verbal  behavior,  so  that  a  gen- 
eral collection  of  abnormals  woidd  score 
Unv.  Or  again,  it  might  mcasiu'e  (as  will 
be  suggested  on  more  evidence  below) 
some  "test-taking"  trait  of  such  a  nature 
that  ^\•hile  not  itself  intimately  related 
to    a    specific    psychiatric    syndrome,    it 
would  be  related  to  the  tendency  of  per- 
sons,  whether  normal   or   abnormal,   to 
ans^\•er  items  in  a  certain  way.  In  which 
case    a    high    "normal"    component    oc- 
curring in  a  person  who  was  also  high 
on  some  pathological  component  might 
operate  to  suggest  improved  chances  of 
his  being  clinically  normal  in  the  same 
way   the   "no-count"   does   the  converse. 
I  do  not  wish  to  defend  any  particular 
hypothesis  regarding  the  nature  of  the 
underlying  personality  trait  sampled  by 
the  HAV  "normal"  items,  but  it  must  be 
vniderstood  for  what  follows  that  the  dif- 
ferentiation of  normals  and  abnormals 
by  a  given  item  is,  as  it  stands,  a  rather 
ambiguous  fact.   So  far  as  I   know,   the 
only  empirical   finding  which  tends  in- 
dependently   to    support    the    Rosanoff 
theory  as  an  interpretation  of  the  psy- 
chological meaning  of  the  H-W  "normal" 
component  is  the  study  of  Freeman  (8) 


mentioned  above. 

In  the  present  investigation  the  initial 
attack  on  the  problem  was  made  in  terms 
of   one    particular   personality    test,    the 
Minnesota    Mu]tii)liasic    Personality    In- 
ventory, which  has  been  described  in  de- 
tail elsewhere  (9,  10.  11,  12,  13,  14,  24). 
In  brief  sunnnary,  this  is  an  empirically 
constructed  set  of  scales  furnishing  scores 
on  nine  personality  variables  named  as 
follows:  Hs  (hypochondriasis),  D  (depres- 
sion),   Hy    (hysteria),    Pd    (psychopathic 
deviate),  Mf  (masculinity-femininity),  Pa 
(paranoia),  Pt  (psychasthenia),  Sc  (schizo- 
]ihrcnia),  and  Ma  (hypomania),  together 
w'ith  three  scales  used  to  determine  the 
jirobable    trustworthiness    of    the    other 
nine.  These  last  three  are:  ?  (number  of 
items  answered  "cannot  say"  and  thus  ef- 
fectively   removed    from    all    scales    on 
which  they  appear),  L  (for  "lie,"  meas- 
uring the  deliberate  or  unconscious  tend- 
ency of  the  sorter  to  put  himself  in  a 
favorable  light)  and  F  (tendency  to  give 
statistically    infrequent    replies,    usually 
suggesting  lack  of  understanding  or  hap- 
hazard   sorting).    Before    beginning    the 
empirical  investigation  it  was  felt  desir- 
able to  formulate  as  well  as  could  be 
done   a   priori    the   logically   exhaustive 
alternative  ways  in  which  a  person  could 
come  out  with  a  deviant  score  on   the 
Multiphasic  on  some  scale  and  yet  avoid 
being  in  a  psychiatric  hospital  nonethe- 
less.   This   requires   a   preliminary   con- 
sideration of  the  theory  of  the  trait  em- 
ployed in  work  with  this  instrument,  and 
will  be  dealt  with  at  some  length  in  the 
next  chapter. 


Chapter  II 
Theoretical  Considerations  Regarding  MM  PI 

IN  TERMS  of  Allport's  distinction  be-  symptomatology.  We  know  that  if  the 
tween  a  "biosocial"  and  a  "biophysi-  patl)ological  process  is  to  be  the  real 
cal"  concept  of  a  trait,  it  can  be  said  criterion  which  specifies  an  entity,  these 
without  hesitation  that  the  attitude  prev-  persons  were  misclassified.  If  the  intent 
alent  among  those  who  work  clinically  of  the  diagnosers  of  1850  was  to  put 
with  the  MM  PI  is  clearly  biophysical.  those  persons  together  who  belonged  to- 
Although  the  diagnosis  of  a  psychosis  gether  because  they  had  the  same  "dis- 
must  necessarily  be  made  at  the  time  of  ease,"  on  these  non-luetic  persons  they 
decision  in  terms  of  the  "social  stimulus  are  now  known  to  have  been  wrong.  It 
value"  of  the  patient  to  the  psychiatric  is  true  that  as  long  as  their  entity  was 
staff,  nevertheless  this  social  stimulus  only  defined  by  the  symptom  syndrome, 
value  is  not  the  intended  content  of  the  they  cannot  be  said  to  have  been 
diagnosis.  The  best  testimony  to  this  "wrong"  in  a  semantic  sense;  but  inas- 
fact  is  the  very  frequent  occurrence  at  a  much  as  the  cases  were  etiologically  and 
case  conference  of  statements  such  as  pathologically  heterogeneous,  these  for- 
"He  looks  to  me  like  a  schizophrenic  but  mer  investigators  would  now  be  willing 
as  you  say,  he  may  Jiot  he."  It  is  regret-  to  concede  that  all  of  the  cases  should 
tably  true  in  the  field  of  psychiatry  that  not  have  been  classified  together,  al- 
precisely  w^hat  it  means  to  he  a  schiz-  though  at  the  time  they  had  no  way  of 
ophrenic  is  by  no  means  clear.  For  those  knowing  which  ones, 
who  are  compulsively  operational  this  is  Another  analogy  can  be  found  in  the 
a  frustrating  state  of  affairs,  but  I  do  not  case  of  units  of  measurement  in  electric- 
see  any  advantage  in  the  approach  which  ity.  Some  writers  may  take  the  coulomb 
says  "We  do  not  yet  know  the  internal  as  the  basic  unit,  defined  in  terms  of  the 
state  which  is  a  necessary  and  sufficient  deposition  of  silver  at  the  cathode  of  a 
condition  for  belonging  to  the  class  of  silver  salt  solution.  In  terms  of  this  the 
schizophrenics,  which  we  would  better  current  can  be  defined  as  time  rate  of 
define  if  we  could;  therefore  schizophren-  passage  of  amount,  the  resistance  in 
ics  are  all  those  labeled  as  such  in  a  terms  of  current  for  a  given  EMF,  and 
psychiatric  diagnosis."  In  some  respects  so  on.  The  manifestations  of  the  passage 
the  situation  is  very  similar  to  that  which  of  an  electric  current  may  be  considered 
existed  in  the  case  of  a  disease-entity  from  the  standpoint  of  such  diverse 
such  as  "general  paralysis  of  the  insane"  operations  as  the  heat  generated  in  a 
before  Noguchi  and  Moore  found  the  resistor,  the  amount  of  deflection  of  a 
spirochete  in  the  brains  of  paretics.  If  compass  needle,  the  rate  of  deposition  of 
we  could  now  go  back  and  get  ^\^asser-  silver,  the  intensity  of  illumination  of  a 
mann,  gold  colloid,  etc.,  tests  on  all  in-  bulb,  and  the  like.  It  is  fortunate  for  the 
dividuals  diagnosed  G.P.I,  from  1850  to  physicist  that  the  dependence  of  these 
1900,  we  should  find  a  number  who  have  different  phenomena  on  the  underlying 
nothing  in  common  with  the  majority  "real"  process  is  sufficiently  close  and 
so  classified  except  certain  similarities  of  straight-forward  so  that  he  does  not  have 
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a  serious  problem  of  "choosing"  one  of 
them  as  his  definition  of  the  amount  of 
current,  in  order  to  be  able  to  give  an 
immediate  ans^\•er  to  the  tjuestion  "Hoav 
much  currcni"  in  those  cases  in  ^vhich 
the  rcsidts  of  these  diverse  operations 
and  observations  failed  to  agree.  Suppose 
now  that  a  Maxwell's  demon  exists  in 
the  siher  solution  and  insists  on  con- 
founding us  bv  arbitrarily  stopping  and 
starting  ions  in  their  passage  to  the 
cathode.  Another  demon  exerts  \ariablc 
amoiuits  of  force  on  the  needle  of  a  gal- 
vanometer. Still  another  interferes  in 
some  way  with  the  measurement  of  il- 
lumination from  oin-  bidb,  and  so  on 
through  all  the  operations  defining 
"amoinit  of  current."  The  result  would 
be  a  failure  of  these  diverse  procedures 
to  yield  values  of  the  "current"  that 
^\•ould  be  isomorphic  with  the  imderly- 
ing  reality  (flow  of  electrons),  and  hence 
also  they  would  fail  to  be  in  good  cor- 
respondence with  one  another. 

Now  the  point  I  wish  to  make  is  that 
it  is  still  not  necessarily  the  best  scien- 
tific procedure  for  the  confused  physicist 
to  say  "W^ell,  there  doesn't  seem  to  be 
any  decent  covariation  in  these  phenom- 
ena, so  I  guess  there  isn't  any  under- 
Ivins:  common  class  of  events  involved. 
Operationally  speaking,  I  have  to  define 
what  I  mean  bv  flo^v  of  current  unam- 
biguously,  so  I'll  pick  galvanometer  de- 
flection. In  the  future,  whatever  else  is 
observed,  whenever  the  galvanometer  is 
deflected  so  much,  I  shall  define  that  as 
the  flow  of  so  much  current.  Any  dis- 
crepancies with  other  facts  are  just  un- 
fortunate, that's  all."  This  approach  is 
positivistically  elegant  but  in  my  opin- 
ion not  calculated  to  bring  about  the 
most  fruitful  scientific  thinking,  particu- 
larly in  a  field  which  is  in  an  embryonic 
state  and  in  which  the  best  criteria  for 


inclusion  of  a  definition  or  concept  are 
not  even  yet  known  except  dogmatically. 
Such  covariation  as  does  remain  after  the 
demons  have  done  their  work  might  be 
fruitfully  lUilized  in  an  effort  to  get 
tentative  interpretations  including  inter- 
pretations of  the  failures  of  correlation, 
and  to  imjiro\e  the  apparatus  so  as  to 
V.cQ\)  the  demons  oiu.  If  the  latter  can 
be  managed,  the  "true"  imderlying  flow 
of  electrons  will  finally  be  arrived  at. 

To  take  a  typical  example  in  the 
clinical  use  of  the  Multiphasic,  one 
might  consider  the  case  of  hysteria.  It 
has  long  been  known  by  psychiatrists 
that  aside  from  the  presentation  of  con- 
version symptoms  which  define  the  clini- 
cal entity  named  hysteria,  there  is  a  more 
or  less  well  defined  personality  structure 
characteristic  of  the  persons  so  diag- 
nosed. At  the  more  superficial  levels  one 
speaks  of  the  tendency  of  the  hysteric 
to  shut  oiu  unpleasant  facts  from  his 
conscious  thinking;  the  "emotional  im- 
maturity" showing  up  by  parasitism, 
sexual  frigidity,  overdependence  upon 
parents,  selfishness  in  love  relationships; 
a  certain  beautiful  indifference  to  the 
symptoms;  Rosenzweig's  "impunitive- 
ness"  as  a  reaction  to  frustration;  and  so 
on.  A  psychoanalyst  goes  on  to  char- 
acterize even  deeper  layers  of  the  per- 
sonality— phallic  genital  level,  etc.  "Hys- 
teria" diagnostically  refers  to  the  pro- 
duction of  certain  disease-mimicking 
symptoms,  yet  a  "hysteroid"  tempera- 
ment is  discernible  and  is  in  fact  utilized 
in  diagnosis,  e.g.,  as  between  a  diagnosis 
of  hysteria  and  hypochondriasis. 

This  clinical  finding  is  supported  by 
the  empirical  findings  of  test  item  anal- 
ysis arrived  at  in  studying  the  responses 
to  the  MMPI  of  persons  so  diagnosed. 
There  is  in  the  set  of  items  of  the  hysteria 
scale  a  group  referring  to  somatic  com- 
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plaints,  the  core  trait  of  the  clinical 
entity  as  the  medical  man  perceives  it. 
But  there  is  also  a  set  of  items  reflecting 
a  tendency  to  deny  psychiatric  and  per- 
sonality problems,  typified  by  responses 
such  as  saying  "false"  to  such  items  as 
"I  frequently  have  to  fight  against  show- 
ing that  I  am  bashful,"  "I  get  mad  easily 
and  then  get  over  it  soon,"  "Often  I 
cannot  imderstand  why  I  have  been  so 
cross  and  grouchy,"  and  so  on.  The  fact 
that  this  set  while  self-correlating  is 
slightly  negative  in  correlation  with  the 
somatic  set  does  not  prevent  the  sum 
of  the  two  sets  from  differentiating  hys- 
terical persons  better  than  either  one 
could  do  it  alone.  Although  persons  in 
general  tend  to  answer  these  items  in  a 
negatively  correlated  way,  yet  a  person 
with  a  clinical  hysteria  has  a  tendency  to 
reverse  this  negative  correlation  and  be 
a  deviate  in  the  same  direction  with  re- 
gard to  both.  It  would  seem  tentatively 
plausible  that  the  "hysterical"  person  is 
characterized  by  his  deviation  on  several 
unrelated  (or  in  this  case  actually  nega- 
tively related)  components  of  personal- 
ity. How  those  components  had  best  be 
defined  can  only  be  found  by  an  empiri- 
cal investigation  of  their  clinical  cor- 
relates, the  way  they  hang  together,  and 
how  to  give  them  the  most  psychologi- 
cal meaning. 

For  the  present  purpose  it  will  be  con- 
venient to  make  a  clear  distinction, 
therefore,  betw^een  the  operationally 
measured  score  achieved  on  a  scale  with 
a  certain  name  on  the  one  hand,  and  the 
hypothetical  amount  of  the  trait  pos- 
sessed on  the  other.  In  what  follows  if  the 
biophysically  intended  trait  is  discussed 
it  will  be  referred  to  by  its  name,  where- 
as the  score  value  will  be  indicated  by 
the  initial  letters  of  the  scale  as  is  com- 
mon in  present  discussion  of  the  MMPI 


at  this  institution.  Thus,  in  the  case  of 
hysteria  we  have  first,  "hysteria,"  the 
clinical  entity  showing  conversion  symp- 
toms and  the  like;  secondly,  we  have 
hypothesized  the  "hysteroid  component" 
(or  components)  which  predispose  to 
hysteria  but  are  not  identical  with  the 
appearance  of  symptoms;  and  thirdly, 
we  have  "Hy"  which  is  the  obtained 
Multiphasic  score.  Allport's  considera- 
tion of  the  "individual"  trait  as  mean- 
ing that  every  person's  hysteroid  com- 
ponent is  unique  and  cannot  truly  be  ar- 
ranged on  any  such  continuum  will  be 
theoretically  conceded  but  can  obviously 
not  be  treated  of  in  measurement  pro- 
cedures of  the  present  nomothetic  sort. 
The  construction  of  the  MMPI  has 
been  detailed  elsewhere  and  will  onlv  be 
summarily  treated  at  this  time.  The  pro- 
cedure has  been  with  trivial  exceptions 
a  thoroughly  empirical  one  which  im- 
plicitly denies  the  usual  assumption  that 
one  can  deduce  the  verbal  behavior  of  a 
certain  sort  of  person  in  a  test-taking 
situation  from  a  priori  considerations 
alone.  Allied  with  this  denial  is  the  at- 
titude that  the  verbal  type  of  personal- 
ity inventory  is  not  most  fruitfully  seen 
as  a  "self-rating"  or  self-description 
whose  value  requires  the  assumption  of 
accuracy  on  the  part  of  the  testee  in  his 
observations  of  self  (26).  Rather  is  the 
response  to  a  test  item  taken  as  an  in- 
trinsically interesting  segment  of  verbal 
behavior  knowledge  regarding  which 
may  be  of  more  value  than  any  knowl- 
edge of  the  "factual"  material  about 
which  the  item  superficially  purports  to 
inquire.  Thus  if  a  hypochondriac  says 
that  he  has  "many  headaches"  the  fact 
of  interest  is  that  he  says  this,  and  that 
fact  would  not  be  controverted  by  any 
investigations  establishing  that  he  has 
interpreted  the  word  "many"  in  a  statis- 
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tically  rare  fashion  and  actually  does  not 
suffer  headaches  any  more  frequently 
than  the  ordinary  person.  Ihc  now 
prevalent  pessimism  with  traditional 
question-answer  personality  tests  because 
they  all  require  the  subject  to  be  both 
honest  and  objective  in  his  self-descrip- 
tions simply  does  not  apply  because  it  is 
what  he  says  of  himself  that  is  diagnostic 
rather  than  what  is  in  fact  the  case. 

With  this  orientation  the  procedure 
of  construction  of  the  MMPI  follows  in 
a  straightforward  fashion.  Consider  any 
homogeneous  clinical  group  for  the  de- 
tection of  which  a  scale  is  to  be  con- 
structed. Then  the  items  on  such  a  scale 
are  selected  by  the  empirical  determina- 
tion of  response-frequencies  for  this 
group  as  compared  Avith  those  for  per- 
sons in  general.  It  may  be  the  case  that 
other  groups  than  the  one  intended  also 
deviate  in  their  responses  to  certain 
items.  This  is  again  determined  em- 
pirically and  such  items  are  either  elimi- 
nated or  a  new  comparison  of  the  gioup 
desired  with  the  erroneously  discrimi- 
nated group  is  made  to  create  a  "correc- 
tion scale"  (lo,  ii).  By  this  technique 
there  have  up  to  the  present  writing 
been  constructed  the  nine  scales  men- 
tioned. 

Before  embarking  on  the  experimental 
material  in  the  present  investigation  it 
is  of  interest  to  formulate  the  possibili- 
ties of  a  "test  miss"  in  so  far  as  they  can 
be  seen  a  priori.  Granted  the  two  simple 
and  well  established  facts  that  (i)  the 
scales  tend  to  identify  the  people  for 
whom  they  were  named,  (2)  they  do  not 
by  any  means  do  so  infallibly,  the  ques- 
tion arises  "Why  does  it  miss  when  it 
does?"  I  have  tried  to  set  down  what 
seems  to  me  an  exhaustive  list  of  the 
ways  in  which  this  could  theoretically 
occur,  keeping  the  above  distinction  be- 


tween a  scale  value  and  the  biophysical 
trait  value  in  mind.  Some  of  these,  es- 
pecially I-B,  arc  not  always  sources  of 
error,  as  shoidd  be  clear  from  the  preced- 
ing discussions,  but  lead  to  errors  of 
certain  kinds  in  certain  cases. 

I.  "Errors  of  incasincnicnt"  as  regards  the 
single  trait  score  itself.  This  includes  all 
those  cases  in  which  the  personality  vari- 
able underlying  the  scale  responses  does 
not  actually  exist  in  the  amount  indi- 
cated by  the  score. 

A.  The  answers  given  do  not  correspond 
to  the  facts  as  the  patient  himself 
sees  them. 

1.  Inadequate  understanding  of  the 
questions  due  to  feeblemindedness, 
organic  brain  disease,  reading  diffi- 
culties,  bilingualism,    etc. 

2.  Failure  to  cooperate  in  the  task  re- 
quired. 

a.  Deliberate  misrepresentation— 
"lying." 

b.  Responding  more  or  less  at  ran- 
dom due  to  boredom,  hurry, 
fear  of  results. 

3.  Unusual  interpretation  of  the 
meaning  of  the  questions  of  a  sort 
other  than  those  interpretations  re- 
lated to  the  personality  trait  it  is 
desired  to  detect. 

B.  The  patient  does  not  see  the  facts  as 
they  are. 

II.  The  testee  actually  possesses  the  under- 
lying trait  to  the  extent  indicated  by  the 
score,  but  he  fails  to  manifest  it  symp- 
tomatically  and  thus  appears  as  a  "test 
miss." 

A.  An  external  situation  or  a  non  per- 
sonality trait  exists  in  his  life  which  is 
especially  favorable  for  adjustment 
and  thus  he  remains  free  of  symptoms. 
Family  protection,  sheltering,  financial 
inheritance,  Murray's  "gratuities"  such 
as  beauty,  high  intelligence,  etc. 

B.  Other  compensating  personality  traits 
exist. 

1.  Other  traits  which  can  act  to  in- 
hibit the  one  in  question.  For  in- 
stance, a  potential  psychopath  is 
kept  out  of  trouble  by  a  concomit- 


A    GENERAL    NORMALITY    OF    CONTROL    FACTOR    L\    PERSONALITY    TESTING      1  I 


ant  psychasthenia   that  makes  him 
be  careful. 
2.  The  hypothetical  "general  control" 
factor  posited  by  Rosanoff. 

It  is  obvious  that  some  of  the  above 
sources  of  error  arc  minimized  by  tech- 
niques included  in  the  MMPI  itself.  For 
example,  the  case  of  deliberate  misrep- 
resentation is  to  some  extent  at  least 
overcome  by  the  inclusion  of  the  Lie 
scale  and  its  more  recent  experimental 
forms.  A  marked  lack  of  understanding 
such  as  that  which  might  occur  in  an 
almost  illiterate  or  foreign-speaking  stib- 
ject  will  almost  inevitably  raise  the  F- 
score  and  thus  make  the  interpreter  sus- 
picious. It  should  be  clearly  understood 
that  the  use  of  categories  such  as  "un- 
usual interpretation"  and  "not  seeing 
the  facts  as  they  are"  does  not  contradict 
the  empiristic  verbal-sample  attitude 
previously  stated. 

Some  varieties  of  unusual  interpreta- 
tion, as  in  the  case  of  frequency  of  head- 
aches, are  correlates  of  the  trait  in  which 
we  happen  to  be  interested,  and  the  un- 
usual interpretation  is  actually  a  basis 
for  our  discrimination.  On  the  other 
hand,  there  may  be  interpretive  differ- 
ences among  persons  which  are  corre- 
lated with  entirely  different  behavior 
variables  including  those  which  have  no 
appreciable  personality  relevance.  For 
instance,  if  one  asks  the  question  "How 
many  are  several  hooks"  he  gets  a  con- 
siderable variation  in  the  connotation 
people  put  upon  this  quite  neutral,  non- 
personological  quantifier.  Aside  from  the 
psychiatric  factors  which  determine  such 
differences  in  interpretation  in  MMPI 
items,  this  other  kind  of  difference  will 
also  operate  and  obscure  the  results.  It 
is  a  superficial  answer  to  say  that  how  a 
person  interprets  the  word  "several"  is 
a  reflection  of  his  personality,  since  it 


may  be  a  reflection  of  a  personality  trait 
which  is  minimally  or  even  negatively 
correlated  with  the  one  for  whose  sam- 
pling the  particular  item  was  statistically 
selected. 

It  might  be  advanced  against  the  fore- 
going considerations  that  the  nature  of 
the  statistical  methods  used  to  select 
items  is  such  as  to  systematically  exclude 
such  factors.  Naturally  the  relative  fre- 
quency of  occurrence  of  this  kind  of 
non-psychiatric  determiner  of  unusual 
interpretations  has  been  made  low  by 
the  setting  of  high  critical  scores  in  the 
neighborhood  of  two  standard  devia- 
tions in  a  "normal"  population  which 
by  definition  must  include  those  individ- 
uals who  are  psychiatrically  normal  but 
make  such  "misinterpretations"  when 
confronted  with  the  items.  Thus  it 
would  be  contended  that  all  of  those 
factors  which  are  psychiatrically  irrele- 
vant but  weight  the  probabilities  of  a 
given  item  response  have  been  taken 
care  of  in  the  setting  up  of  norms.  The 
statistical  cogency  of  this  argument  may 
be  fully  conceded  without  detracting  in 
any  way  from  the  value  of  investigating 
those  cases  who  belong  in  the  "missed" 
group  no  matter  where  the  critical  line 
is  drawn.  The  point  is  that  we  would 
prefer  to  redvxce  still  further  the  prob- 
abilities of  a  miss  in  either  direction, 
and  these  combined  probabilities  are  a 
function  of  the  total  overlap  of  groups 
as  well  as  of  the  arbitrary  critical  score 
set.  Thus,  if  you  want  to  "catch"  more 
of  the  actual  cases  of  schizophrenia,  you 
have  to  avoid  missing  some  of  those 
formerly  missed.  This  means  a  lowering 
of  the  critical  score,  and  immediately  a 
rise  in  the  frequency  of  normals  er- 
roneously called  schizophrenic.  You  have 
to  pay  for  an  improvement  in  one  direc- 
tion by  a  loss  in  another.  But  if  there 
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were  procedures  for  isolating  among  your 
"normals"  some  of  the  sub-grou})s  who 
achieve  relatively  deviant  scores  without 
being  trait-deviant,  these  sub-gr()uj)s 
coidd  then  be  s\stcinatically  iclentificd 
and  the  net  result  would  be  a  reduction 
of  errors  for  any  specified  critical  level 
chosen  in  practice. 

The  entire  problem  can  be  stated  suc- 
cinctly in  terms  of  tlic  probabilities  in- 
volved in  sampling  from  a  collective  of 
a  defined  character.  The  relative  fre- 
quency of  a  "plus"  response  for  any 
specified  item  is  determined  for  the  large 
and  heterogeneous  collective  called  "nor- 
mal" defined  by  their  not  being  in  a 
hospital  or  under  the  care  of  a  physician. 
If  a  person  belongs  to  this  collective, 
the  probability  of  his  saying  "Yes"  to 
an  item  indicating  abnormality  is  em- 
pirically known.  Combine  several  such 
items  scored  in  a  given  way,  and  the 
probability  of  a  member  of  the  given 
collective  saying  "Yes"  to  all  of  them  is 
small.  Combine  a  sufficiently  large  num- 
ber of  such  and  the  probability  of  his 
saving  "Yes"  to  all  of  them  may  become 
vanishingly  small.  For  some  other  group, 
e.g.,  the  set  of  all  hypochondriacs  so  diag- 
nosed, this  probability  is  empirically 
known  to  be  much  larger.  Now  among 
the  whole  set  of  "normals"  there  are 
various  sub-sets  whose  presence  requires 
us  to  raise  our  critical  score  in  order  to 
avoid  miscalling  them  hypochondriacs. 
Some  of  them  are  potential  and  semi- 
hypochondriacs,  but  not  all  of  them.  It 
is  in  these  sub-sets  of  non-hypochondria- 
cal  high-scoring  normals  that  we  are  in- 
terested. Their  high  scores  are  a  function 
of  some  variables  we  do  not  at  the  mo- 
ment measure,  and  if  we  could  isolate 
them  among  all  the  normals  we  would 
find  ourselves  with  a  little  sub-collective 
for  which  the  relative  frequency  of  the 


significant  responses  would  be  dillercnt 
from  normals  generally.  I'hat  they  are 
not  a  very  sizeable  minority  is  neces- 
sarily implied  by  the  very  definition  of 
a  "deviant"  score,  of  course;  but  they  are 
the  important  minority  who  produce  the 
test  misses. 

To  reiterate  at  this  point  that  "The 
statistical  frequencies  take  care  of  all 
that"  is  to  ignore  the  fact  that  sub-sets 
with  diverse  relative  frequencies  may  be 
combined  to  produce  a  total  collective 
in  which  the  relative  frequency  is  greater 
than  some  of  the  subsets  and  smaller 
than  others.  Any  procedure  for  the 
identification  of  the  members  of  such 
sub-sets  will  constitute  an  improvement 
in  the  measurement.  It  is  necessary  to 
realize  that  there  is  no  such  thing  as 
"the  probability"  in  the  empirical  sense 
of  frequencies.  If  probability  is  a  rela- 
tive frequency — and  in  this  type  of  anal- 
ysis it  cannot  reasonably  be  interpreted 
in  any  other  way — then  "the  probability" 
is  undefined  until  a  class  for  which  the 
frequencies  are  to  be  determined  is 
specified.  With  every  additional  fact  one 
specifies  a  narrower  sub-collective  for 
which  the  relative  frequencies  continu- 
ally oscillate  as  the  membership  is  re- 
duced. The  "true"  probability  is  not  to 
be  thought  of  as  unfortunately  unde- 
termined for  some  experimental  or  tech- 
nical reason  but  rather  as  being  simply 
undefined  until  the  limits  of  the  class 
are  specified.  Furthermore,  the  set  of 
such  probabilities  all  stand  on  the  same 
footing  and  cannot  be  discriminated 
among  as  to  which  is  "best."  If  an  in- 
surance actuary  wishes  to  define  the 
probability  that  I  will  die  in  the  next 
year,  he  may  begin  with  the  relative 
frequency  of  deaths  per  year  for  w^hite 
inarried  males  at  my  age.  If  we  add  the 
fact   that   I   am   of  Norwegian   ancestry 
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the  probability  will  change,  say  for  the 
better.  II  we  add  that  I  have  a  mitral 
regurgitation  it  changes  again,  for  the 
worse.  If  we  add  that  I  have  no  cardiac 
enlargement,  it  changes  again,  for  the 
better.  If  we  add  that  I  make  a  practice 
of  running  upstairs,  it  changes  again  for 
the  worse,  and  so  it  goes.  It  is  a  mistake 
to  assume  that  the  successive  values 
reached  by  such  a  procedure  correct  the 
probabilities  previously  arrived  at,  since 
the  latter  are  based  upon  frequencies 
for  larger  classes  and  hence  do  not  refer 
to  the  same  quantity.  We  have  a  set  of 
probabilities  each  of  which  is  correct 
for  the  class  to  which  it  refers,  and  the 
isolation  of  a  sub-set  for  the  class  for 
which  the  relative  frequency  is  different 
cannot  be  said  to  refute  the  former 
statement  in  any  meaningful  sense. 

It  may  be  asked  "What  probability 
should  be  chosen  in  practice?"  The  an- 
swer to  this  question  is  given  by  Reichen- 
bach  (29) — one  should  choose  the  most 
restricted  class  for  which  the  number  of 
cases  is  still  large  enough  to  yield  stable 
relative  frequencies.  To  quote  his  ex- 
ample for  the  simplest  case, 

"Imagine  a  class  A  within  which  an  event 
of  the  type  B  is  to  be  expected  with  the 
probability  1/2;  if  we  wager,  then,  always  on 
B,  we  get  50  per  cent  successes.  Now  imagine 
the  class  A  split  into  two  classes,  A^  and  A^; 
in  A^,  B  has  a  probability  of  1/4,  in  A^,  B 
has  a  probability  of  3/4.  We  shall  now  lay 
different  wagers  according  as  the  event  of 
the  type  B  belongs  to  A^  or  to  A^;  in  the  first 
case,  we  wager  always  on  non-B,  in  the 
second,  on  B.  "We  shall  then  have  75  per  cent 
successes." 

Although  this  example  is  taken  from  the 
probability  of  attributes,  the  principles 
apply  in  all  essentials  tf)  the  case  of 
continuous   variables. 

With  this  orientation  one  can  see  the 
so-called  "validity"  scales  of  the  MMPI 


in  a  new  light,  namely,  as  procedures  for 
the  detection  of  persons  who,  although 
they  belong  to  the  "normal"  collective, 
also  belong  to  sub-collectives  for  which 
the  relative  frequencies  of  certain  item 
responses  (and  hence  of  raw  scores)  are 
not  the  same  as  they  are  for  the  normal 
collective  as  a  whole.  If  only  4  per  cent 
of  unselected  "normals"  achieve  a  score 
above  two  standard  deviations  on  a  given 
scale,  we  shall  erroneously  segiegate  that 
4  per  cent  if  we  were  to  use  the  test  by 
itself  for  diagnosis.  If  among  that  4  per 
cent  are  some  who  yielded  deviant  scores 
because  they  did  not  understand  the 
English  language,  these  persons  may  be 
detected  by  another  measurement — the 
scale  F.  If  they  constitute  any  appreci- 
able proportion  of  this  4  percent,  the 
use  of  the  F-scale  becomes  profitable. 
Such  a  use  of  F  for  the  isolation  of  a 
sub-collective  with  atypical  frequencies 
among  the  normal  in  no  ways  contra- 
dicts the  overall  figure  of  4  per  cent 
which  remains  perfectly  correct  as  a 
frequency  obtained  in  the  appropriate 
(larger)  class. 

It  is  not  unthinkable  that  the  further 
development  of  clinically  useful  per- 
sonality inventories  of  the  question-an- 
swer type  may  hinge  in  considerable  part 
on  the  practical  application  of  this  line 
of  thought.  If,  as  now  appears  probable 
from  our  evidence,  part  of  the  discrimi- 
nating power  of  such  test  items  arises 
from  what  is  essentially  a  "projective" 
element  in  the  test-making,  we  shall  do 
well  to  retain  "ambiguities"  with  delib- 
erate intent.  But  there  are  probably 
few  verbal  ambiguities  which  cannot  be 
settled  projectively  by  the  testee  as  a 
function  of  several  different  factors.  Or- 
dinarily only  one  of  these  factors  is  of 
interest  to  us.  The  purpose  of  "validity 
scales '    and     'correction    scales"    is    to 
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partial  out,  so  to  speak,  the  influence  of 
the  others.  Horst  has  discussed  this  same 
problem  with  mathematical  rigor  under 
the  name  "suppression  variables"  and 
"clearing  variates,"  as  follo\\s  (italics 
mine): 

"From  the  iK)int  of  view  of  ihc  variables 
involved,  the  foregoing  trcatnieiit  suggests 
a  way  of  looking  at  the  problem  of  statistical 
prediction  which  has  not  been  previously 
emphasized.  The  conventional  way  has  been 
to  regard  the  prediction  problem  as  con- 
sisting of  two  sets  of  variables,  viz.,  the  vari- 
ables to  be  predicted,  that  is  the  dependent 
variables;  and  the  variables  from  which  pre- 
dictions are  made  or  the  independent  vari- 
ables. 

"Tlie  foregoing  analysis  suggests  the  utility 
of  regarding  the  independent  variables  as 
consisting  of  two  sets.  The  one  set  .should 
have  appreciable  correlations  ^\•ith  the  de- 
pendent variable.  The  other  set  should  have 
negligible  correlations  with  the  criterion  or 
dependent  variable  but  appreciable  correla- 
tions with  the  other  independent  variables. 
For  convenience,  we  may  designate  the  first 
set  of  independent  variables  as  the  prediction 
variables  and  the  second  set  the  suppression 


variables.    These    latter    have    been    called 
"clearing  variates"  by  Mendershausen, 

"From  a  common  sense  point  of  view,  it  is 
not  dillicult  to  explain  wliat  is  happening 
in  this  type  of  prediction  system.  In  most 
practical  prediction  problems,  it  is  im- 
possible to  find  useful  predictiou  variables 
u'hirh  do  not  liave  appreciable  components 
independent  of  the  criterion  variable,  so  that 
wc  arc  usually  predicting  part  of  the  criterion 
and  also  other  components  independent  of 
the  criterion.  What  we  need  in  this  case  is 
another  set  of  variables  independent  of  tlie 
criterion  but  correlated  with  those  compo- 
nents of  the  prediction  variables  which  are 
independent  of  the  criterion.  By  means  of 
the  second  set  we  should  be  able  to  suppress 
the  irrelevant  components  of  the  prediction 
variables."  {i6,  p.  434). 

The  present  investigation  will  be  seen 
from  the  evidence  presented  to  consist 
essentially  in  the  empirical  isolation  of 
a  set  of  items  on  the  MMPI  which  act 
in  the  role  of  suppression  variables  for 
at  least  some  of  the  abnormal  compo- 
nents, although  the  behaviorial  natme  of 
the  components  thus  suppressed  remains 
largely  mysterious. 


Chapter  III 
Derivation  of  the  N-Scale 


PLAN  OF  investigation 

IN  ACCORDANCE  witli  the  usual  empiri- 
cal procedure  of  scale  derivation 
which  has  been  found  fruitful  in  the 
development  of  the  MMPI,  the  follow- 
ing plan  of  study  was  employed  in  the 
present  investigation: 

1.  A  group  of  cases  was  selected  by  the 
fact  that  the  profiles  showed  deviations 
that  would  ordinarily  be  labeled  "ab- 
normal," in  spite  of  the  apparent 
freedom  of  the  subjects  from  incapacita- 
ting psychiatric  involvement.  These 
cases  will  be  referred  to  hereinafter  by 
the  phrase  "criterion  normals." 

2.  A  second  group  of  cases  was  selected 
so  that  their  profiles  would  match  as 
closely  as  was  practicable  the  profiles 
(case  by  case)  of  the  first  group,  but  this 
second  group  was  drawn  from  the  hos- 
pital population  and  thus  was  composed 
entirely  of  individuals  whose  psychiatric 
involvement  was  such  as  to  result  in 
their  being  in  a  psychopathic  unit.  This 
group  is  referred  to  hereinafter  as  the 
"criterion  abnormals." 

3.  An  item  analysis  was  carried  out 
on  all  495  items^  of  the  MMPI  and  those 
items  which  showed  a  statistically  signif- 
icant differentiation  between  the  two 
groups  were  brougiit  together  on  one 
key  to  form  a  scale. 

4.  The  central  tendency  and  variabil- 
ity of  the  resulting  scale  was  obtained 
for  a  random  group  of  "normal"  per- 
sons, and  standard  scores  calculated  for 
further  easy  reference. 

5.  The  differentiating  properties  of 
this  scale  were  studied  by  systematic  ap- 

"  Only  495  items  were  involved  because  of  the 
original  504  items  available  on  old  forms,  9  have 
been  eliminated  in  the  present  box. 


plication  to  various  special  groups,  such 
as  the  original  criterion  groups,  "test" 
normals  and  abnormals  from  diverse 
sources,  college  students,  WPA  workers, 
persons  of  different  chronological  ages, 
and  so  on. 

6.  The  internal  consistency  and  test- 
retest  stability  of  the  scale  were  deter- 
mined by  the  usual  procedures. 

7.  The  intercorrclation  between  tlic 
scale  and  the  other  scales  of  the  MMPI 
were  calculated  for  both  normal  and  ab- 
normal populations. 

8.  A  minor  experiment  in  the  selec- 
tion of  normal  from  abnormal  profiles 
with  and  without  the  scale  in  question 
was  performed. 

g.  From  an  analysis  of  the  preceding 
findings  plus  an  inspection  of  the  items 
on  the  scale,  tentative  interpretive  pos- 
sibilities are  suggested  and  possible  ex- 
perimentation for  this  purpose  pro- 
posed. 

the  "criterion  normals" 

The  first  step  was  the  selection  of  a 
group  of  cases  showing  abnormal  pro- 
files by  usual  standards  but  not  under 
psychiatric  care.  As  is  the  case  with  all 
work  on  the  MMPI  it  was  necessary  to 
accept  a  very  crude  and  superficial  de- 
finition of  "normality."  Ideally,  one 
would  prefer  to  be  able  to  have  intimate 
psychiatric  knowledge  concerning  the 
behavior  and  history  of  tlie  persons  so 
designated,  but  this  is  obviously  impos- 
sible for  practical  reasons  if  it  is  desired 
to  have  a  large  and  representative 
enough  group  to  carry  out  an  item 
analysis.  Furthermore,  it  is  necessary  to 
avoid  equating  "normal"  to  "well-ad- 
justed" in  some  optimal  meaning  of  the 
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latter,  since  what  is  of  clinical  import- 
ance is  in  a  fundamental  sense  this  very 
crude  brute  fact  of  having  managed  to 
stay  out  of  the  hands  of  a  psychiatrist. 
Thirdly,  it  is  desirable  in  view  of  the 
previous  era  of  overoptimism  regarding 
the  power  of  personality  tests  to  define 
the  situation  so  as  to  make  it  as  "hard" 
for  the  test  as  possible,  an  end  which  is 
certainly  facilitated  by  the  blanket  as- 
sumption that  all  persons  who  are  not 
under  a  doctor's  care  or  in  a  psycho- 
pathic ward  are  by  that  token  more 
"normal"  than  all  of  those  who  are.  In 
the  following  discussion  "normal"  means 
that  the  profile  is  that  of  a  person  who 
at  the  time  of  examination  was  not 
under  a  doctor's  care. 

The  entire  collection  of  "normal" 
cases  (N  =  691)  in  the  records  was  can- 
vassed in  order  to  locate  all  profiles 
showing  any  of  the  eight  components 
(Mf  has  been  ignored  throughout  the 
present  study)  with  a  T-score  equal  to 
or  exceeding  70.-  These  691  normal  cases 
make  up  the  standardization  group  upon 
which  the  other  scales  have  been  de- 
veloped, and  their  composition  has  been 
previously  described  (9,   10,   ii,   12,   13). 

Since  it  was  a  priori  doubtful  whether 
the  suppression  variables  for  all  eight  of 
the  abnormal  components  would  turn 
out  to  be  the  same,  the  original  item 
analysis  was  confined  to  those  normals 
from  this  group  who  showed  elevations 
(T  ^  70)  on  any  of  the  three  scales  of 
the  so-called  "neurotic  triad"  (Hs,  D, 
or  Hy).  In  the  event  that  the  same  sup- 
pression variables  might  operate  in  the 
case  of  other  scales,  this  fact  would  be 
subsequently    elicited;    whereas    if   such 

*A11  MMPI  scales  are  expressed  as  T-scores, 

X  — X 

where  T  z=  50  -f-  10 
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were  not  the  case  the  confounding  of 
factors  due  to  pooling  several  varieties 
of  deviant-scoring  normals  might  result 
in  a  failure  to  locate  discriminating 
items.  The  first  group  studied,  therefore, 
consisted  of  persons  scoring  T  ^  70  on 
at  least  one  (and  possibly  more)  of  these 
three  scales,  but  no  systematic  effort  was 
made  to  avoid  cases  in  which  some  other 
scale  was  also  elevated.  The  exclusion 
of  all  but  "pure"  neurotic  triad  profiles 
woidd  have  reduced  the  number  avail- 
able for  study  beyond  a  workable  limit. 

From  this  preliminary  pool  of  deviant 
scoring  normals  a  smaller  group  was 
selected  by  tlie  application  of  certain 
important  restrictions  regarding  the 
probable  validity  of  the  profiles.  Be- 
cause it  was  clear  already  that  the  num- 
ber of  cases  for  analysis  would  be  quite 
small,  the  restrictions  regarding  prob- 
able validity  were  made  fairly  rigorous. 
In  addition  it  was  considered  that  to 
loosen  these  restrictions  might  result  in 
a  mere  duplication  of  the  F-scale,  a  sup- 
pression variable  already  extant.  Ac- 
cordingly the  following  two  validity  re- 
strictions were  imposed  in  the  selection 
of  a  smaller  set  from  the  group  consisting 
of  all  normals  showing  any  T  ^  70: 

1.  In  order  to  make  practically  certain 
that  no  case  was  included  in  which  the 
elevation  of  profile  was  due  to  careless- 
ness or  lack  of  understanding,  it  was  re- 
quired that  the  F-score  of  any  case  ac- 
cepted should  not  exceed  T  =  60.  It 
was  recognized  that  this  would  exclude 
an  unknown  proportion  of  individuals 
whose  F  was  more  elevated  not  because 
of  non-cooperation  or  misunderstanding 
but  because  of  validly  "unusual"  re- 
sponses reflecting  the  same  personality 
trends  as  the  elevated  profile  suggested. 
But  for  the  above  reasons  it  was  felt 
preferable  to  lose  this  group  than  to  in- 
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elude  cases  where  cooperation  or  under- 
standing was  doubtful  or  in  which  the 
subject  had  become  careless  toward  the 
end  of  the  testing  period. 

2.  In  order  to  avoid  an  excessive  fre- 
quency of  "Cannot  say"  responses  in  do- 
ing the  item  analysis  on  so  few  cases,  as 
well  as  to  exclude  cases  of  doubtful 
validity  indicated  by  such  an  excess,  no 
case  accepted  showed  a  "?"  score  of 
T>64. 

No  restrictions  were  placed  upon  L 
(lie)  because  the  error  introduced  by  in- 
cluding persons  who  consciously  or  un- 
consciously put  themselves  in  a  favorable 
light  would  be  opposite  to  the  error  one 
wished  most  to  avoid— that  of  including 
cases  in  which  the  deviant  profile  was 
spuriously  Jiigh  due  to  some  factor  or 
other. 

The  imposition  of  these  restrictions 
reduced  the  total  number  of  cases  for 
study  to  only  42,  of  which  17  were  males 
and  25  were  females.  No  effort  was  made 
to  make  the  sample  "representative" 
with  regard  to  such  variables  as  age,  sex, 
or  socio-economic  status,  since  this 
would  have  further  reduced  its  size.  The 
ages  ranged  from  20  to  45  with  a  mean 
age  of  32.9  years  (32.2  for  the  males  and 
33.3  for  the  females).  Except  for  such 
minimal  comments  as  were  written  on 
the  record  sheet  attached  to  each  test,  no 
further  material  was  available  regarding 
these  persons.  With  the  possible  excep- 
tion of  a  self  rating  by  the  subject  as 
to  whether  he  considered  himself  "nerv- 
ous" or  not,  this  additional  information 
was  of  no  value.  Since  this  latter  self 
rating  is  essentially  a  condensed  variant 
of  a  great  mass  of  such  self  judgments 
contained  in  the  test  itself  it  also  is  of 
little  value,  although  it  is  worth  noting 
that  18  of  the  42  cases  stated  that  they 
considered  themselves  to  be  "nervous." 


Tablf,  I 

Mean  T-Scores  on  All  Scales  for  the  42 
Criterion  Normals 


Both  Sexes 

Males 

Females 

(N  =  42) 
52.0 

(N  =  i7) 

(N  =  2s) 

? 

52.3 

Si-7 

L 

53  0 

515 

54-0 

F 

53-0 

52.9 

53-0 

Hs 

66.1 

66.9 

65-5 

D 

66.8 

66.1 

67-3 

Hv 

64. 1 

63.3 

64.6 

Pd 

56. 5 

55-2 

57-4 

Pa 

55-4 

55-6 

55-3 

Pt 

60.6 

60.6 

60.6 

Sc 

56.4 

55-4 

57-1 

Ma 

48.5 

47-5 

49.2 

The  mean  T-scores  for  these  42  cases 
as  well  as  for  the  two  sexes  separately 
are  presented  in  Table   1. 

It  will  be  noted  that  although  every 
one  of  these  cases  was  selected  for  show- 
ing either  Hs,  D,  or  Hy  with  T  ^  70, 
the  mean  profile  is  within  the  limit 
usually  called  "normal,"  since  individ- 
uals do  not  tend  to  exceed  70  on  all  three 
in  most  cases.  Twelve  cases  show  scores 
^  80,  the  other  30  scoring  between  70 
and  79.  Although  the  profiles  were 
selected  on  the  basis  of  the  neurotic 
triad  and  apparent  validity  only,  one  no- 
tices that  with  the  exception  of  Ma  for 
males,  all  scales  show  a  mean  above  the 
population  average  50,  and  the  general 
profile  pattern  is  qin'te  similar  for  the 
two  sexes.  The  three  validity  indicators 
arc  very  comfortably  low. 

We  have  here,  then,  a  group  of  42 
persons  who  were  cooperative  and  under- 
stood their  task  at  the  time  they  took  the 
inventory;'  who  score  two  or  more  stand- 
ard deviations  above  the  mean  of  the 
general  population  on  one  or  more  of 
the  eight  abnormal  components;  and 
who  were  not  under  the  care  of  a  physi- 

•  From  now  on  the  term  nalid  as  applied  to 
profiles  will  he  used  in  this  restricted  sense,  and 
has  no  connotation  of  "agreement  with  other 
criteria." 
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cian  at  the  time.  While  it  is  true  that 
these  are  merely  the  deviates  of  the  "nor- 
mal" population  on  which  the  T-scores 
were  deri\ed,  the  important  thing  to 
keep  in  mind  is  that  persons  whose  pro- 
files are  no  more  deviant  than  these  can 
be  found  in  the  hosjMtal  Avitli  an  in- 
capacitating psychiatric  involvement.  It 
is  in  comparison  with  these  latter  i)er- 
sons  that  the  present  group  of  "normal 
deviates"  are  of  interest.  The  dichotomy 
"normal"  and  "abnormal"  may  be  arti- 
ficially made  sharp  by  the  definition  in 
terms  of  "in  or  out  of  a  hospital."  The 
dividing  line  in  the  case  of  the  meas- 
ured (scale)  component  is  even  more  ar- 
bitrary, and  although  the  problem  has 
been  stated  in  terms  of  the  critical  score 
70,  it  is  clear  that  if  there  were  no  "ab- 
normals"  with  scores  as  low  as  this  the 
occinrence  of  a  small  group  of  scores 
among  normals  which  exceed  this  criti- 
cal line  would  merely  exemplify  a  statis- 
tical truism  and  the  group  involved 
would  have  little  practical  interest  for 
us.  As  long  as  the  analysis  was  ultimately 
to  be  carried  through  on  matched  groups 
in  any  event,  one  could  probably  have 
chosen  some  other  arbitrary  value  as 
critical  for  selection  of  profiles,  either 
higher  or  lower  than  70. 

THE    "criterion   ABNORMALS" 

The  42  profiles  obtained  from  the 
criterion  normals  were  then  considered 
one  at  a  time  wuth  reference  to  the  pro- 
files of  the  whole  available  population 
of  hospital  cases.  Each  "normal"  profile 
was  matched  as  closely  as  possible  for 
all  8  components,  as  to  sex,  and  as  to 
age  (always  within  10  years).  In  spite  of 
the  laborious  canvassing  of  over  400  ab- 
normals  which  was  repeated  with  each 
of  the  42  criterion  normals,  it  was  of 
course    impossible    to    achieve    an    ex- 


tremely close  match  to  each  criterion 
normal  when  eight  variables  in  addition 
to  age  and  sex  were  involved.  Most  at- 
tention was  paid  to  the  three  scales  of 
the  nemotic  triad,  btu  all  eight  scales 
A\ere  taken  into  accoinit  to  some  extent. 
Organic  cases  were  excluded. 

As  regards  the  validity  indicators,  it 
was  necessary  in  the  case  of  the  criterion 
abnormals  to  eliminate  "lying"  or  at 
least  that  aspect  or  type  of  lying  which 
is  detected  already  by  the  L  scale.  For 
this  reason  no  case  was  included  in  the 
criterion  abnormal  group  if  its  L-score 
exceeded  60,  with  one  exception  which 
was  imavoidable  if  any  reasonable 
matching  was  to  be  made  on  the  other 
variables.  The  "?"  was  not  allowed  to 
rise  above  56,  nor  the  F  above  62.  (These 
limits  were  not  chosen  arbitrarily  to  be- 
gin with,  but  are  the  upper  limits  finally 
obtained  when  matching  was  completed 
with  the  importance  of  these  three  vari- 
ables being  kept  in  mind  during  the 
matching  process.) 

As  to  clinical  diagnosis,  all  17  of  the 
male  matched  abnormals  were  diagnosed 
psychoneurosis,  as  were  all  but  three  of 
the  25  female  matched  abnormals.  These 
three  cases  were  diagnosed  manic-depres- 
sive manic  (questionable  diagnosis, 
atypical  case);  psychopathic  personality 
with  hysteria  and  anxiety;  and  paranoid 
condition  with  recurrent  depression. 

The  ages  of  these  criterion  abnormals 
range  from  19  to  47  with  a  mean  age  of 
32.3  years  (31.9  for  men  and  32.5  for 
women). 

The  mean  T-scores  for  the  42  criterion 
abnormals  as  well  as  by  sexes  separately 
are  presented  in  Table  2,  together  with 
the  corresponding  scores  for  the  42  cri- 
terion normals  with  which  they  were 
matched. 

These    results    are    graphically    repre- 
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Table  2 

Mean  T-Scores  for  the  42  Criterion  Abnormals, 
Sexes    Separately    and    Pooled,    Compared 
with  Those  of  the  42  Criterion  Normals 
with  Whom  They  Were  Matched 


42  Cri- 

42 Cri- 

17 

25 

terion 

terion 

Male 

Female 

Nor- 

Abnor- 

Abnor- 

Abnor- 

mals 

mals 

mals 

mals 

? 

52.0 

50-7 

50.2 

Si-i 

L 

53-0 

52 

9 

52.4 

53-3 

F 

53-0 

51 

8 

52.0 

51.6 

Hs 

66.1 

62 

Q 

61 .4 

63 -9 

D 

66.8 

68 

.3 

69.9 

67.  2 

Hv 

64. 1 

65 

9 

63 -4 

67.5 

Pd 

56.5 

5.S 

7 

52.5 

54-5 

Pa 

55-4 

55 

7 

56.1 

55-4 

Pt 

60.6 

55 

5 

56.1 

55-0 

Sc 

56.4 

52 

9 

53-5 

52.5 

Ma 

48.5 

51 

3 

52.4 

50.6 

sentcd  in  the  accompanying  profiles. 
(Fig.  1)  It  will  be  noted  that  the  most 
marked  difference  in  both  sexes  occurs 
in  the  case  of  Pt  (psychasthenia),  in 
which  the  normals  are  about  one-half  a 
sigma  above  the  abnormals.  This  differ- 
ence may  be  partly  the  reason  for  the 
correlation  later  found  between  the 
derived  N  scale  and  Pt.  On  the  other 


hand,  it  is  difficult  to  see  how  such  a 
systematic  trend  arose  as  a  result  of  the 
matching  process  unless  there  were  some 
factor  in  or  associated  with  the  Pt  scale 
which  contributes  to  "normality"  in  the 
presence  of  elevated  scores  on  the  neu- 
rotic triad. 

Of  the  eight  differences  between 
means  of  the  criterion  groups,  three  are 
statistically  significant  at  the  5  per  cent 
level.  Pt  is  significantly  higher  for  the 
normals  (P  <  .001),  Pd  is  significantly 
higher  for  the  normals  (P  <  .01),  and  Sc 
is  significantly  higher  for  the  normals 
(P  <  .01).  It  is  worth  noting  although 
inexplicable  that  in  all  three  compari- 
sons the  criterion  normals  have  the  more 
"abnormal"  (elevated)  score. 

The  differences  show  that  the  match- 
ing procedure,  in  spite  of  the  laborious 
profile-by-profile  technique  employed, 
was  not  completely  successful  in  elimi- 
nating systematic  differences  between  the 
two  groups.  The  matching  was  as  good 
as  could  be  reasonably  expected  under 
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Fig.   1.  Mean  profiles  of  the  42  criterion  normals  and  the  42  criterion  abnormals 

with  whom  ihcy  were  matched. 
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these  circumstances,  and  the  differences, 
while  statistically  significant  in  three  in- 
stances at  least,  are  not  in  the  direction 
which  would  be  serious  for  the  present 
investigation.  Correlations  subsequently 
found  Avith  other  scales  or  random  sam- 
ples of  normals  and  abnormals  are  made 
ambiguous  to  interpret  to  some  extent 
because  of  these  differences  in  the  orig- 
inal groups  used  in  item  selection. 
Nevertheless  it  must  be  clear  that  the 
correlations  cannot  be  siunmarily  dis- 
missed as  being  due  to  this  matching 
factor,  since  it  is  necessary  to  explain  the 
systematic  matching  difference  which 
can  hardlv  have  been  due  to  any  bias 
on  the  part  of  the  matcher.  A  "correla- 
tion" of  \ariables  in  the  supply  might 
residt  in  such  systematic  differences  in 
matching  equallv  easilv  as  the  converse. 

THE    ITEM    ANALYSIS 

"We  nt)w  have  two  groups  of  valid  in- 
dividual profiles,  one  being  from  a  set  of 
persons  who  exhibit  "deviant"  MMPI 
profiles  in  terms  of  the  arbitrary  critical 
score  70  but  are  not  in  a  psychiatric 
ward  or  under  a  physician's  care  ("cri- 
terion normals");  the  other  from  a  set 
of  persons  with  valid  and  no  more  devi- 
ant profiles  than  the  first  group  but  in 
the  psychopathic  unit  because  they  have 
not  been  able  to  make  an  adjustment 
outside  ("criterion  abnormals").  The 
next  step  is  to  determine  whether  there 
are  any  items  which  show  a  significant 
differentiation  between  these  groups.  If 
any  such  normality  or  suppression  vari- 
able as  has  been  provisionally  hypothe- 
sized should  actually  exist,  it  seems  a 
priori  likely  that  at  least  some  among  the 
495  items  of  verbal  behavior  might  be 
at  least  in  part  a  function  of  this  vari- 
able. It  is  not  to  be  expected  that  a 
scale  composed  of  items  so  selected  will 


be  either  behaviorally  or  mathematic- 
ally homogeneous.  At  the  present  stage 
of  analysis  it  would  be  foolish  to  insist 
that  it  should  be.  More  refined  statistical 
procedures  can  be  applied  subsequently 
if  the  end  of  factorial  purification  is 
deemed  worthy,  although  it  is  not  a  self- 
evident  proposition  that  such  is  to  be 
desired  in  any  case.  Any  "homogeneity" 
of  the  first  scale  derived  by  this  method 
consists  precisely  in  tlic  method  of  its 
derivation,  namely,  that  it  tends  to  dis- 
tinguish persons  who  get  abnormal 
scores  and  are  abnormal  from  those  who 
get  equally  abnormal  scores  but  are  nor- 
mal. 

In  scoring  the  MMPI  a  "significant" 
response  to  any  given  item  is  defined  as 
that  response  (whether  "Yes"  or  "No") 
which  is  made  by  a  minority  of  the 
standardization  group.  These  responses 
are  indicated  with  a  red  plus  (  +  )  on  the 
answer  sheet  and  are  also  commonly  re- 
ferred to  as  "plus"  responses.  In  what 
follows  the  terms  significant  and  plus 
will  be  used  hereafter  in  this  special 
sense,  whereas  the  term  "abnormal"  will 
be  used  to  designate  the  response  char- 
acteristic of  the  criterion  abnormals  in 
the  present  study.  As  will  be  seen  later, 
the  two  indications  are  not  in  perfect 
agreement  by  any  means,  rather  the 
reverse  being  true. 

After  tallying  and  summing  the  "sig- 
nificant" responses  for  each  of  the  495 
items  for  both  criterion  groups,  these 
sums  were  then  converted  to  propor- 
tions, giving  the  per  cent  significant  res- 
ponses to  each  item  by  the  criterion 
normals  and  bv  the  criterion  abnormals. 
All  items  for  which  the  difference  in 
proportion  of  significant  responses  be- 
tween normals  and  abnormals  reached 
or  exceeded  .20,  which  is  approximately 
at  the  5  per  cent  level  of  significance  in 
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samples  of  42  cases,  were  examined  fur-  items    which    show    "statistical"    signifi- 
ther  by  more  exact  tests.  In  a  few  cases  cance   when   they   are  answered   in   the 
items  in  which  the  proportions  were  very  same  direction  by  practically  every  mem- 
close  to  .00  or  1.00  were  further  investi-  her  of  both  populations,  for  in  such  in 
gated  even  though  tlic  differences  were  stances   the   "significance"   arises  mereh 
less    than    .20    because    of    the    greater  from  the  deviant  values  of  the  propoi 


statistical  significance  of  differences 
among  proportions  approximating  these 
limits. 

The  preliminary  survey  yielded  103 
items  which  showed  differences  in  the 
neighborhood  of  .20  between  proportion 
of  plus  responses  for  the  two  criterion 
groups.  The  significance  of  the  differ- 
ences was  then  determined  more  exactly 
by  the  t-test  for  significance  of  differ- 
ence of  two  proportions,  using  the  for- 
mula: 


tions  and  the  effective  discriminating 
power  of  such  an  item  when  included  in 
a  scale  is  hardly  worth  the  effort  of  scor- 
ing it. 

An  examination  of  the  differences  be- 
tween proportions  for  those  items  show- 
ing p  near  .00  or  1.00  using  the  normal- 
ized C.R.  of  Zubin's  nomographs  (34) 
shows  that  not  more  than  six  items  would 
have  been  included  had  this  more  liberal 
test  been  employed. 

On   applying  the  criterion  of  signifi- 

Pi  -  P2 


t  = 


/N,p, 


+  N2P2  / 

V 


Nipi  +  N2P2 


/\Ni      N2  / 


+  N2     \  Ni  -f  No 

All  items  for  wliich  the  value  of  t  from  cance  to  the  original  set  of  103  items,  a 

this  formula  equaled  or  exceeded   1.96  smaller  group  of  78  items  was  finally  se- 
wcre  considered  significantly  differential 


mg. 

It  is  true  that  this  formula,  while  an 
improvement  over  the  older  formula  for 
the  standard  error  of  a  difference  of  two 
proportions,  still  involves  approxima- 
tions when  it  is  applied  to  proportions 
in  the  neighborhood  of  .00  and  1.00,  for 
which  the  sampling  distributions  are  so 


lected  which  met  this  criterion.  Of  these 
78  items,  9  show  differences  3  times  or 
more  their  standard  error;  62  show  dif- 
ferences between  2  and  3  times  their 
standard  errors;  and  the  remaining  7 
items  show  differences  between  1.95  and 
2.00  times  their  standard  errors  (just 
reaching  the  5  per  cent  level).  Of  the  7 
items  with  a  C.R.  between  1.95  and  2.00, 


markedly  skewed  except  for  very  large      4  are  significant  for  one  sex  but  not  for 


values  of  N.  However,  here  also  the  error 
introduced  is  in  a  harmless  direction, 
inasmuch  as  for  such  very  deviant  values 
of  p  the  above  formula  for  testing  signifi- 
cance tends  to  underestimate  the  real  sig- 
nificance of  a  difference,  so  that  one  is 
much  more  likely  in  such  cases  to  miss  a 
significant  item  than  erroneously  to  seg- 
regate an  actually  non-differentiating 
one.    In   the   case   at   hand,   we  are   not 


the  other.  But  in  these  cases  the  differ- 
ences are  in  the  same  direction  for  both 
sexes  and  not  less  than  .12  in  any  case. 
This  set  of  78  items,  scored  as  indicated 
in  the  next  section,  constitutes  the  scale 
hereinafter  referred  to  as  N. 

The  initial  "N"  was  originally  chosen 
as  a  reference  to  "normal,"  having  in 
mind  the  theory  of  Rosanoff  previously 
mentioned  and  the  analogous  scale  called 


especially    desirous   of   locating   all    the      .V  by  Humm  and  ^Vadsworth.  It  seems 
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rather  clear  from  what  follows  later  that 
this  N  is  "normality"  only  in  the  sense 
of  a  suppression  variable  or  correction 
scale,  and  not  in  the  original  sense  of  a 
positive  personality  variable  of  inhibitory 
or  "controlling"  natme. 

The  78  items  on  the  N  scale  are  as  fol- 
lows, the  letter  T  or  F  indicating  the  di- 
rection in  which  the  criterion  7wri}7als 
tend  to  respond  as  contrasted  with  the 
criterion  abnormals.  In  other  words, 
those  persons  who  show  deviant  profiles 
on  the  other  MM  PI  scales  but  have  re- 
mained in  the  comminiity  out  of  psychi- 
atric hands,  have  a  greater  tendency  to 
give  the  indicated  response  than  those  of 
similar  profile  but  in  the  hospital  with  a 
psychiatric  distinbance. 

CONTENT  OF  ITEMS  ON  N  SCALE 

(These  items  are  listed  according  to  their 
numbers  as  given  in  the  back  of  the  revised 
manual  of  directions,  1943.  Indicated  answer 
is  more  characteristic  of  the  criterion  nor- 
mals.) 
A-15     I   am   troubled   by  attacks  of  nausea 

and  vomiting.     (F) 
A-20     I  have  had  attacks  in  which  I  could 

not  control  my  movements  or  speech 

but  knew  what  was  going  on  around 

me.     (F) 
A-30     Peculiar  odors  come  to  me  at  times. 

(T) 
A-32     I  can  read  a  long  Avhile  without  tiring 

my  eyes.     (F) 
A-39     I  have  no  trouble  swallowing.     (F) 
B-9       I  am  almost  never  bothered  by  pains 

over  the  heart  or  in  my  chest.     (F) 
B-51     I   have   been   quite   independent   and 

free   from   family  rule.     (F) 
C-7       My  relatives  are  nearly  all  in  sympathy 

with  me.     (F) 
C-21     I  hate  to  have  to  rush  when  working. 

(T) 
C-25     I   have  often   lost   out   on   things  be- 
cause I  could  not  make  up  my  mind 

soon  enough.     (T) 
C-33     It  takes  a  lot  of  argument  to  convince 

most  people  of  the  truth.     (T) 
C-34     When   I   am   feeling  very  happy  and 


C-35 
C-46 

D-9 
D-13 


D-15 
D-17 

D-18 


D-20 


D-52 


E-11 


E-13 
E-15 


E-24 


E-27 


E-28 
E-49 

E-51 

E-52 

F-4 

F-6 

F-7 
F-8 


F-13 

F-17 
F-28 


active,  someone  who  is  blue  or  low 
will  spoil  it  all.     (T) 

I  liked  school.     (F) 

The  only  interesting  part  of  the  news- 
papers is  tlic  funnies.     (T) 

Religion  gives  mc  no  worry.     (F) 

I  feel  sure  there  is  only  one  true  re- 
ligion.    (T) 

I  believe  there  is  a  God.     (T) 

I  believe  in  a  life  hereafter.     (T) 

I  believe  in  the  second  coming  of 
Christ.     (T) 

The  only  miracles  I  know  of  are 
simply  tricks  that  people  play  on 
one  another.     (F) 

I  think  most  people  would  lie  to  get 
ahead.     (T) 

At  times  I  have  been  so  entertained 
by  the  cleverness  of  a  crook  that  I 
have  hoped  he  would  get  by  with 
it.     (F) 

Policemen   are   usually  honest.     (F) 

It  wouldn't  make  me  nervous  if  any 
members  of  my  family  got  into 
trouble  with  the  law.     (F) 

I  prefer  to  pass  by  school  friends,  or 
people  I  know  but  have  not  seen  for 
a  long  time,  unless  they  speak  to  me 
first.     (T) 

At  parties  I  am  more  likely  to  sit  by 
myself  or  with  just  one  person  than 
to  join  in  with  the  crowd.     (T) 

I  love  to  go  to  dances.     (F) 

It  does  not  bother  me  that  I  am  not 
better  looking.     (F) 

I  do  not  like  to  see  women  smoke. 
(T) 

People  often  disappoint  me.     (T) 

I  am  easily  embarrassed.     (T) 

I  am  not  unusually  self-conscious.     (F) 

What  others  think  of  me  does  not 
bother  me.     (F) 

It  makes  me  uncomfortable  to  put  on 
a  stunt  at  a  party  even  when  others 
are  doing  the  same  sort  of  things. 
(T) 

I  have  sometimes  stayed  away  from 
another  person  because  I  feared 
doing  or  saying  something  that  I 
might  regret  afterwards.     (T) 

I  feel  unable  to  tell  anyone  all  about 
myself.     (T) 

People  have  often  misunderstood  my 
intention  when  I  was  trying  to  put 


ii 
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them  right  and  be  helpful.     (T) 
F-31     I  easily  become  impatient  with  people. 

(T) 
F-34     Criticism  or  scolding  hurts  me  terribly. 

(T) 
F-47     Often,  even  though  everything  is  going 

fine  for  me,  I  feel  that  I  don't  care 

about    anything.     (T) 
F-48     I  very  seldom  have  spells  of  the  blues. 

(F) 

F-53  I  have  often  felt  badly  over  being  mis- 
understood when  trying  to  keep 
someone  from  making  a  mistake. 
(T) 

G-i  I  feel  anxiety  about  something  or 
someone  almost  all   the   time.     (T) 

G-3       I  believe  my  sins  are  unpardonable. 

(F) 
G-14     I    tend    to    be    interested    in    several 

different   hobbies  rather   than   stick 

to  one  of  them  for  a  long  time.     (T) 
G-17     I  like  to  keep  people  guessing  what 

I'm  going  to  do  next.     (T) 
G-27     I  am  often  said  to  be  hot  headed.     (T) 
G-28     I   am   not   easily  angered.     (F) 
G-29     I  get  mad  easily  and  then  get  over  it 

soon.     (T) 
G-31     At  times  I  feel  like  smashing  things. 

(T) 
G-32     At  times  I  feel  like  picking  a  fist  fight 

with  someone.     (T) 
G-33     I    have    the    wanderlust    and    I    am 

never  happy  unless   I   am  roaming 

or  traveling  about.     (T) 
G-38     When  I  leave  home  I  do  not  worry 

about  whether   the  door  is  locked 

and  the  windows  closed.     (F) 
G-40     In  walking  I  am  very  careful  to  step 

over  sidewalk  cracks.     (T) 
G-45     Often  I  cross  the  street  in  order  not  to 

meet  someone  I  see.     (T) 
G-49     When  someone  says  silly  or  ignorant 

tilings    about    something    I    know 

about,  I  try  to  set  him  right.     (F) 
H-2       I   have   often   thought   that  strangers 

were  looking  at  me  critically.     (T) 
H-18     If  given   a   chance   I   could  do  some 

things  that  would  be  of  great  bene- 
fit to  the  world.     (T) 
H-28     I  get  anxious  and  upset  when  I  have 

to    make    a    short    trip    away    from 

home.     (T) 
H-34     Lightning  is  one  of  my  fears.     (T) 


H-35     A  windstorm   terrifies  me.     (T) 

H-36     I  am  not  afraid  of  fire.     (F) 

H-41     I  have  no  fear  of  water.     (F) 

H-43  I  do  not  worry  about  catching  dis- 
eases.    (F) 

H-46  I  am  afraid  to  be  alone  in  the  dark. 
(T) 

H-47  I  am  afraid  when  I  look  doAvn  from 
a  high  place.     (T) 

H-52  I  have  no  fear  of  going  into  a  room 
by  myself  where  other  people  have 
gathered  and  arc  talking.     (F) 

I-5  Horses  that  don't  pull  should  be  beaten 
or  kicked.     (T) 

I- 10  The  future  is  too  uncertain  for  a  per- 
son to  make  serious  plans.     (T) 

I-ii  It  is  great  to  be  living  in  these  times 
when  so  much  is  going  on.     (F) 

I- 14  I  frequently  ask  people  for  advice. 
(T) 

I-15  My  plans  have  frequently  seemed  so 
full  of  difficulties  that  I  have  had  to 
give  them  up.     (T) 

I-17  I  wish  I  could  get  over  worrying  about 
things  I  have  said  that  may  have 
injured  other  people's  feelings.     (T) 

I- 18  I  often  must  sleep  over  a  matter  before 
I  decide  what  to  do.     (T) 

I-30  My  ^vay  of  doing  things  is  apt  to  be 
misunderstood  by  others.     (T) 

I-38  I  often  think,  "I  -wish  I  were  a  child 
again."     (T) 

I-40     I  am  entirely  self-confident.     (F) 

I-41  Once  in  a  while  I  think  of  things  too 
bad  to  talk  about.     (T) 

Of  the  whole  group  of  78  items,  it 
should  be  noticed  that  by  far  the  majority 
of  them  are  answered  in  the  "significant" 
or  "plus"  direction  by  the  criterion  nor- 
mals more  than  by  the  criterion  abnor- 
mal. There  are  only  14  exceptions  to 
this  trend,  namely,  items  A-15,  A-20, 
C-21,  D-15,  D-17,  D-18,  E-ii,  E-15,  E-51, 
G-3,  G-14,  ^'5'  ^'^4'  J-4^-  Furthermore,  it 
is  evident  that  in  the  great  majority  of 
the  items  the  response  most  characteristic 
of  the  criterion  normals  is  not  only  the 
statistically  unusual  one  but  is  the  one 
which  on  face  inspection  of  the  item 
would  be  considered  the  "bad"  or  "tin- 
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adjustive"  response.  Although  this  kind 
of  observation  is  of  less  value  than  is 
sometimes  believed  in  tests  of  this  soil, 
nevertheless  the  very  marked  tendency 
for  the  criterion  normals  to  say  "bad" 
things  about  themselves  more  than  the 
matched  abnormals  is  of  considerable  in- 
terest. 

A  scoring  key  was  next  constructed  in 
which  the  direction  indicated  above  as 
characterizing  the  criterion  normals  more 
than  the  criterion  abnormals  was  the 
scored  response.  Thus,  if  a  subject  sorts 
card  A- 15  into  the  "False"  category  he 
receives  one  raw  score  point  for  N  ("nor- 


mality"); while  if  he  sorts  card  A-30  into 
the  "True"  category  he  receives  one  raw 
score  point.  High  raw  scores  on  N,  there- 
fore, indicate  "normality"  whereas  low 
raw  scores  indicate  "abnormality,"  at 
least  as  far  as  persons  having  deviant  pro- 
files are  concerned.  In  what  follows,  the 
raw  N-scorc  of  a  person,  therefore,  means 
the  number  of  responses  he  made  to  the 
N  items  which  agree  with  the  response 
indicated  by  the  letters  T  and  F  in  the 
foregoing  list — the  number  of  times  he 
responded  in  the  direction  preferred  by 
the  criterion  normals. 


.  , 


Chapter  IV 
Reliability  and  Differentiating  Power  of  the  N-Scale 


RELIABILITY 

I  OR  50  normal  males  whose  mean  and 
SD  were  practically  identical  with 
that  of  the  sample  of  "men  in  general" 
on  which  T  scores  were  subsequently 
based,  the  odd-even  reliability  of  N  was 
.81,  which  indicates  a  fair  degree  of  in- 
ternal consistency  of  the  items.  For  50 
normal  females  of  high  school  age  re- 
tested  at  intervals  varying  from  4  to  14 
months,  the  test-retest  reliability  was  .74, 
which  is  somewhat  lower  than  the  typical 
test-retest  reliabilities  reported  thus  far 
for  MMPI  scales,  although  these  previous 
reliabilities  were  based  upon  a  different 
sample. 

In  the  case  of  personality  measurement 
the  desirability  of  high  test-retest  re- 
liabilities is  dependent  in  part  upon 
whether  "function  variability"  itself  is 
great  or  small.  If  the  hypothetical  "con- 
trol" factor  of  Rosanoff  were  involved, 
we  should  require  any  alleged  measure 
of  it  to  show  a  fairly  good  retest  stability. 
Since  we  do  not  know  the  psychological 
nature  of  what  is  measured  by  N,  it  is 
not  possible  to  say  to  w'hat  extent  the  co- 
efficient of  .74  is  lowered  by  test  "unreli- 
ability" and  to  what  extent  it  would  have 
to  be  lowered  to  some  degree  below  unity 
by  a  function  fluctuation  which  if  not 
detected  by  the  scale  would  be  indicative 
of  the  latter's  invalidity. 

In  the  case  of  split-half  reliability  the 
same  problem  presents  itself  as  was  pre- 
viously referred  to  in  connection  with  the 
dynamic  inhomogencity  of  the  items  on 
the  Hy  scale.  Further  "purification"  of 
N  will  have  to  be  undertaken  for  other 
reasons,  but  not  simply  to  increase  the  in- 
ternal consistency  which  may  or  may  not 


be  valued,  depending  upon  the  practical 
situation. 

DIFFERENTIATION    OF   THE    CRITERION 
GROUPS 

When  the  scoring  key  for  N  had  been 
constructed  the  first  step  was  to  score  the 
criterion  cases  on  the  whole  key  to  de- 
termine whether  or  not  the  scale  was  able 
to  differentiate  them  sufficiently  well  to 
justify  applying  it  to  other  test  groups. 
That  the  differentiation  of  the  criterion 
groups  should  be  good  is  only  a  necessary 
and  by  no  means  a  sufficient  condition  for 
accepting  such  a  scale,  since  the  most  im- 
portant test  lies  in  its  application  to 
"test"  cases,  other  than  those  cases  upon 
which  the  item  analysis  was  made. 

The  distribution  of  scores  for  the  cri- 
terion normals  and  criterion  abnormals 
is  discussed  in  the  next  section.  It  will 
be  noted  that  the  separation,  while  by 
no  means  perfect  or  as  clean  as  could  be 
desired,  is  certainly  significant  and  the 
overlap  not  excessively  great.  It  is  neces- 
sary to  avoid  undue  leniency  toward  the 
instrument  being  studied  by  attempting 
to  "explain"  all  exceptions  to  the  rule 
desired,  so  that  the  overlap  that  exists 
must  be  taken  at  its  face  value  in  evalu- 
ating the  separation.  Nevertheless,  it 
should  at  least  be  noted  that  it  would  be 
of  considerable  interest  to  have  informa- 
tion regarding  the  adjustment  of  those 
persons  who  constitute  the  low  tail  end 
of  the  distribution  of  criterion  normals. 
For  this  reason  it  is  perhaps  not  out  of 
place  to  refer  to  the  meager  data  avail- 
able on  the  information  sheets  attached 
to  their  records,  from  which  we  learn 
that   the   three   male   criterion   normals 
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Table  3 

Comparisons  of  Means  and  N'ariances  by  Sex  and  Psychiatric  Status 

A.  Tests  for  homogeneit\-  of  variance  by  sex  and  psychiatric  status 


N 


d.f. 


Sex  group: 
Male  normals 

Female  normals 

Sex  group: 

Male  abnormals 

Female  abnormals 

Criterion  group: 
42  normals 

42  abnormals 


42  criterion 

normals 

17 

16 

120.6518 

25 

24 

100.0833 

43  criterion  abnormals 

17 

16 

68.7330 

25 

24 

98.8433 

42 

41 

108.5271 

42 

41 

89.9891 

1 .206 


1-437 


1 .206 


Mean 


SD 


Diff 


tdiff 


Normals: 

1 7  male  normals 

42 

1765 

10 

656 

25  female  normals 

45 

4000 

9 

802 

Abnormals: 

17  male  abnormals 

22 

8824 

8 

043 

25  female  abnormals 

27 

5200 

9 

741 

3-2235 


4.6376 


•985 


1 .627 


P>.o5 


P>.o5 


P>.o5 


B.  Significance  of  difference  of  means  of  males  and  females 


.30<P<.4o 


[0<P<.20 


having  the  lowest  raw  scores  on  this  dis- 
tribution (20,  21,  and  27)  were  all  un- 
employed at  time  of  testing.  Of  the  three 
female  criterion  normals  showing  raw 
scores  of  26,  one  is  a  college  student  (the 
significance  of  which  will  be  elaborated 
subsequently),  another  is  unmarried  at 
age  30  and  has  a  deviant  church  affilia- 
tion. All  three  of  these  female  normals 
consider  themselves  "nervous"  according 
to  the  record  sheet. 

More  precise  statistical  tests  of  the  sig- 


nificance of  difference  between  the  groups 
show  that  the  variances  of  the  males  and 
females  do  not  differ  significantly,  nor 
do  their  means,  within  either  the  normal 
or  abnormal  group.  Nor  does  the  vari- 
ance of  the  normals  differ  significantly 
from  that  of  the  abnormals.  (Summary 
of  data  in  Table  3.) 

Consequently  the  sexes  were  pooled 
and  the  t-test  applied  to  test  the  signifi- 
cance of  difference  between  means  of 
the  normals  and  abnormals.  The  mean 


Table  4 

Comparison  of  Means  on  N  Scale  for  the  42  Criterion  Normals 

versus  the  42  Criterion  Abnormals 


Group 

N 

Mean 

SD 

Meandiff 

C.R. 

P 

Criterion  normals 
Criterion  abnormals 

42 
42 

44.0952 
25.6429 

10.293 
9-373 

18.4523 

8.487 

P<.ooi 
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score  of  the  42  normals  was  44.10,  with 
a  SD  of  10.293;  the  mean  score  of  the  42 
abnormals  was  25.64,  with  a  SD  of  9-373. 
The  test  of  significance  between  these  two 
means  shows  t  =  8.48,  which,  with  80 
degrees  of  freedom,  is  clearly  significant 
(P  <  .001).  Data  for  these  various  com- 
parisons are  shown  in  Table  4. 

Inasmuch  as  the  difference  is  taken  be- 
tween matched  groups,  a  more  appropri- 
ate test  of  significance  is  the  t  based  upon 
differences  between  matched  members  of 
each  pair.  When  this  analysis  is  made, 
we  find  that  the  best  estimate  of  the 
SEjiff  is  1.5167  Avhich  yields  fj  of  12.42. 
Of  the  42  pairs  involved,  in  only  one  case 
did  a  given  normal  have  a  lower  N-score 
than  the  criterion  abnormal  with  whom 
he  was  matched.  This  was  a  female  col- 
lege student  whose  case  was  mentioned 
above  in  regard  to  overlap. 

The  median  of  all  42  normals  is  46, 
which  is  exceeded  by  one  case  from  the 
abnormals.  Considering  the  sexes  sepa- 
rately, no  male  abnormal  reaches  or  ex- 
ceeds the  median  of  the  male  normals; 
and  one  female  abnormal  reaches  or  ex- 
ceeds the  median  of  the  female  normals. 
The  inference  from  these  findings  was 
that  the  N-scale  had  sufficient  differen- 
tiating power  on  the  criterion  groups  to 
warrant  further  study  on  other  groups. 

DISTRIBUTION     OF     N     FOR     UNSELECTED 

NORMALS   AND  TSCORES   BASED 

UPON   THIS 

It  was  necessary  to  make  a  preliminary 
standardization  in  terms  of  T-scores  even 
though  it  was  realized  that  the  scale 
would  not  be  used  in  its  first  form.  This 
latter  was  recjuired  for  convenient  inter- 
pretation when  it  was  found  that  al- 
though the  sex  differences  are  not  signifi- 
cant in  the  criterion  groups  (although, 
it  will  be  noted,  in  the  same  direction 


among  both  normals  and  abnormals) 
there  were  significant  differences  between 
the  means  of  men  and  women  sampled 
at  random  from  the  general  population 
of  "normals." 

Inasmuch  as  the  T-scores  to  be  derived 
were  merely  for  convenience  in  the  pres- 
ent study  and  would  not  be  used  in  prac- 
tical work,  it  was  sufficient  to  base  the 
standardization  on  a  smaller  number  of 
cases  than  usually  is  employed — in  this 
case  upon  100  males  and  100  females. 
The  standard  error  of  the  mean  given 
by  100  cases  is  only  about  1  point  if  we 
assume  the  SD  obtained  to  be  reasonably 
accurate  at  around  10  raw  score  points. 
Accordingly,  100  cases  were  selected  at 
random  from  the  "normal"  files,  taking 
30  cases  from  each  age  group,  16-25,  26- 
35,  36-45,  and  the  remaining  10  cases 
from  the  age  group  46-55.  Cases  with 
L  >  70,  "?"  >  60,  and  F  ^  60  were  ex- 
cluded as  of  doubtful  validity;  otherwise 
the  first  consecutive  30  cases  in  each  age 
group  were  taken  from  files. 

For  the  100  normal  males,  the  mean  N- 
score  was  29.14  with  a  SD  of  955;  for  the 
100  normal  females  the  mean  was  35.59 
and  the  SD  9.80.  This  difference  is  in  the 
same  direction  as  that  found  in  the  cri- 
terion cases,  although  in  this  case  the  dif- 
ference is  based  upon  sufficient  cases  to 
reach  statistical  significance.  The  vari- 
ances do  not  differ  significantly  as  seen 
in  Table  5,  but  the  means  are  significant- 
ly different. 

It  should  be  pointed  out  that  the  fore- 
going difference  is  not  only  statistically 
stable  but  is  of  such  magnitude  as  to  be 
of  practical  importance  and  warranting 
separate  norms;  since  a  difference  of  six 
raw  score  points  is  6/10  of  a  standard 
deviation  for  both  groups.  Since  high  T- 
scores  on  the  MMPI  indicate  abnormality 
(in  the  sense  that  higher  scores  increase 


2  6 


PAUL    E.    MEEHL 


Tablk  5 

Test  of  Significance  of  Difference  of  Means  and  Variances  for 
loo  Normal  Males  and  loo  Normal  Females 


Group 

N 

Mean 

V 

F 

Meanaiff 

C.R. 

P 

Males 
Females 

lOO 
ICO 

29.14 
35-59 

9-55 

9 .  So 

I .026* 

6.45 

4.712 

P<  .001 

*  The  F  of  1.026  is  not  significant  at  the  5  per  cent  level,  so  the  variances  are  considered  not  to 
differ. 


the  ockls  that  the  person  is  or  will  be 
psychiatrically  ill)  this  custom  has  been 
preserved  in  the  present  scale,  although 
there  are  aromncnts  on  both  sides  as  to 
its  advisability.  The  relation  of  raw 
scores  to  T-scorcs  is,  therefore,  an  inverse 
one,  contrary  to  that  which  obtains  in 
all  the  other  MMPI  scales.  U  a  person 
showing  elevated  scores  on  the  person- 
ality components  has  a  lozv  raw  score,  he 
responds  to  the  N  items  in  the  direction 
more  characteristic  of  the  criterion  ab- 
normal group;  and  this  indicates  a  great- 
er probability  of  his  being  clinically 
abnormal  than  if  he  had  responded 
more  freqtiently  in  the  scored  direction 
on  N.  Hence  such  a  low  raw  score  is 
given  a  high  T  score  in  the  table.  If  a 
profile  shows  an  elevation  on  any  of  the 
personality  components,  the  higher  the 
T-score  is,  the  greater  is  our  suspicion  of 
an  actual,  behavioral  deviation  in  respect 
to  the  syndrome  indicated  by  the  par- 
ticular scale  on  which  the  elevation  oc- 
curs. Whether  high  T-scores  on  N  (i.e., 
low  raw  scores)  indicate  a  likelihood  of 
abnormality  in  the  absence  of  any  signifi- 
cant elevation  on  the  personality  com- 
ponents proper  remains  to  be  seen;  that 
is,  an  elevated  N-component  by  itself  is 
not  prima  facie  suggestive  of  abnormal- 
ity, as  is  obvious  from  the  method  of 
scale  derivation. 

This  procedtire  has  what  may  appear 
to  be  an  unfortunate  consequence  in  that 
with  a  mean  of  around  30  to  35  and  a  SD 


of  10,  the  upper  limit  of  possible  T  scores 
is  fixed  by  the  fact  that  no  raw  score  is 
negative,  so  that  no  one  can  get  a  T  score 
above  80.  This  fact  is  comparable  to  the 
lower  limit  of  possible  T  scores  that  ex- 
ists on  the  MMPI  scales,  and  is  related  to 
the  nature  of  the  items  which  yields  a 
marked  positive  skewness  on  these  other 
scales.  It  is  possible  that  with  further  re- 
finement it  will  be  fotmd  advisable  to 
construct  a  set  of  T-scorcs  based  upon 
the  distribtuion  of  normal  persons  with 
abnormal  profiles,  rather  than  tipon  the 
general  unselected  normal  population. 
This  is  especially  likely  in  view  of  the 
fact  that  the  scale  seems  to  have  little 
significance  by  itself  as  regards  the  differ- 
entiation of  normals  from  abnormals,  in 
which  case  the  only  rational  application 
of  it  would  be  to  the  group  (both  nor- 
mals and  abnormals)  who  score  high  on 
the  personality  Components.  If  this  were 
done  one  would  not  bother  scoring  N  on 
any  profiles  except  those  showing  a  sus- 
picious elevation  on  one  of  the  eight  per- 
sonality components,  and  the  T  score 
would  specify  the  position  of  such  a  per- 
son on  the  distribution  of  all  "normals" 
with  such  deviant  profiles. 

In  the  case  of  both  males  and  females, 
the  unselected  normals  achieve  a  mean 
score  about  half  way  between  the  means 
of  the  normal  and  abnormal  criterion 
group  means  for  their  respective  sexes. 
The  relationships  are  shown  in  Table  6. 

In  the  case  of  the  males,  the  criterion 
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Table  6 

Mean  T-Scores  of  Unselected  Normals,  Criterion 

Normals  and  Abnormals,  Both  Sexes 


Group 


Mean  T-Score 
on  N  Scale 


Male  unselected  normals 
Female  unselected  normals 
Male  criterion  normals 
Male  criterion  abnormals 
Female  criterion  normals 
Female  criterion  abnormals 


5° 
50 

37 
56 
40 

58 


abnormals  average  about  i.Q  sigma 
"above"  their  matched  normals,  in  terms 
of  the  variability  of  the  unselected  male 
population;  whereas  the  female  criterion 
abnormals  average  about  l8  sigma 
"above"  their  matched  normals,  in  terms 
of  the  variability  of  the  unselected  fe- 
male population. 

Combining  the  sexes  after  converting 
to  T-scores,  the  following  table  of  cumu- 
lative frequency  for  the  entire  group  of 
criterion  normals  shows  their  overlap 
with  the  imselected  normal  population 
(Table  7). 

Table  7 

Distribution  of  T- Scores  for  42 
Criterion  Normals 


Cumu- 

Cumu- 

T-Score 

Fre- 
quency 

Per 

Cent 

lative 
Fre- 
quency 

lative 

Per 

Cent 

I0<T<20 

I 

2 

I 

2 

2o<T^30 

8 

19 

9 

21 

30<T^4o 

20 

48 

29 

69 

4o<T<5o 

6 

14 

35 

83 

50<T^6o 

7 

17 

42 

100 

No  case  among  the  criterion  normals 
obtained  a  T  of  over  60  on  the  N-Scale. 
The  median  T  score  for  the  entire  group 
is  33-5,  sexes  pooled,  and  83  per  cent  of 
the  cases  do  not  exceed  the  mean  of  the 
unselected  normals. 

As  regards  the  criterion  abnormals, 
Table  8  gives  the  comparable  percentages 
for  them.  No  case  among  the  criterion 
abnormals  obtained  a  T  as  small  as  30 


on  the  N-scale,  and  only  one  obtained  a 
T  as  small  as  40,  whereas  69  per  cent  of 
the  normals  fell  at  T  ^  40.  The  median 
T  of  the  entire  42  criterion  abnormals 
was  59. 

Table  8 

Distribution  of  T-Scores  for  42 
Criterion  Abnormals 


Cumu- 

Cumu- 

T-Score 

Fre- 
quency 

Per 
Cent 

lative 
Fre- 
quency 

lative 

Per 

Cent 

Tg20 

0 

0 

42 

100 

2o<Tg3o 

0 

0 

42 

100 

30<T^4o 

I 

2 

42 

100 

40<T^5o 

9 

21 

41 

98 

50<T<6o 

14 

33 

32 

76 

6o<T<70 

16 

38 

18 

43 

7o<T<8o 

2 

5 

2 

5 

DEVIATION    OF    THE     "COLLEGE    LEVEL" 
GROUP    ON    THE    N    SCALE 

In  all  of  the  work  with  MMPI  it  has 
been  found  necessary  to  take  special  pre- 
cautions regarding  the  college  popula- 
tion and  college-educated  persons.  The 
extent  to  which  these  persons  deviate  in 
their  responses  to  many  items  from  the 
general  population  is  sufficiently  great 
to  necessitate  elimination  of  items  which 
on  other  grounds  appear  to  be  good  dif- 
ferentiators for  the  variable  being  stu- 
died, but  also  unfortunately  "differen- 
tiate" college  people  from  people  in 
general.  In  spite  of  the  care  exercised  in 
this  respect  in  construction  of  the  per- 
sonality scales  of  the  MMPI,  it  is  still 
observed  that  college  means  differ  some- 
what from  those  of  persons  in  general. 
The  typical  college  student,  for  example, 
shows  a  deviation  below  the  mean  on  Hs 
and  D,  whereas  he  tends  to  show  an  ele- 
vation on  Hy.  It  was  suspected  that  the 
college  (and  apparently  "college-edu- 
cated") group  might  show  considerable 
differences  from  people  in  general  on 
the  N  scale,  particularly  when  the  appli- 
cation of  N  to  some  of  those  cases  which 
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originally  aroused  interest  in  the  possi- 
bility of  such  a  scale  failed  to  reveal  as 
high  scores  on  N  as  would  be  suggested 
by  their  being  clinically  normal  in  spite 
of  having  deviant  profiles.  Accoidingly, 
males  and  females  of  the  college  popula- 
tion were  sampled  in  an  effort  to  deter- 
mine the  maonitude  of  this  difference. 

For  a  sampling  of  50  college  males  the 
mean  raw  score  on  N  was  21.40  which 
corresponds  to  a  T  score  of  59,  not  quite 
a  full  standard  deviation  above  the  mean 
for  the  general  male  population.  The  SD 
oi  these  50  male  college  students  was 
8.938,  as  compared  to  9.55  for  the  gen- 
eral unselected  male  normals.  The  dif- 
ference between  these  means  is  signifi- 
cant (t  =  4.746  P  <  -01)  but  the  differ- 
ence bctAvcen  the  standard  dc\iations  is 
not. 

For  a  similar  sample  of  100  college  fe- 
males (from  ^vhich,  however,  profiles 
showing  any  T  ^  70  except  Mf  has  been 
excluded  for  another  study)  the  mean 
raw  score  on  N  was  25.93,  which  corre- 
sponds to  a  T  score  of  about  60,  again 
about  one  SD  abo\e  the  mean  of  the  gen- 
eral female  normal  population.  If  the 
cases  with  "abnormal"  profiles  are  in- 
cluded, the  mean  rises  slightly  to  27.16, 
corresponding  to  a  T  score  of  59.  The 
SD  of  the  entire  gioup  of  122  female  col- 
lege students  is  8.079  which  in  this  case 
does  differ  significantly  (P  <  .05)  from 
that  of  the  general  female  normal  popu- 
lation. 

The  explanation  of  this  rather  pro- 
nounced deviation  from  the  mean  of  the 
general  population  is  not  clear.  The 
fact  of  this  deviation,  however,  is  of 
great  importance  for  two  reasons.  First, 
it  has  a  bearing  upon  the  interpretation 
of  the  N  scale,  of  such  a  nature  as  to 
dovetail  with  other  evidence  in  casting 
serious   doubt  upon  any   interpretation 


of  the  scale  in  terms  of  Rosanoff's  "nor- 
mal" component.  Secondly,  it  indicates 
either  that  the  scale  cannot  be  applied 
to  college  students  or  else  that  if  so  ap- 
plied it  must  be  interpreted  using  a  spe- 
cial set  of  norms. 

Tlie  frequency  of  college  graduates 
among  those  test  and  criterion  cases 
which  seemed  to  be  test  "misses"  (i.e., 
were  out  of  the  hospital  in  spite  of  ele- 
vated MMPI  profiles  but  also  showed  an 
elevated  T  on  the  N  scale)  suggested  that 
having  been  a  college  student  might  alter 
the  likelihood  of  a  high  score  even  though 
the  person  was  not  still  in  college  at  time 
of  testing.  This  suspicion  was  strength- 
ened by  the  fact  that  a  group  of  college 
graduates  studied  with  the  MMPI  in  an 
industrial  concern  showed  the  usual  col- 
lege pattern  even  though  the  age  range 
was  considerable. 

To  test  this  hypothesis  there  were  se- 
lected at  random  from  a  sample  of  pro- 
files obtained  in  this  industrial  concern 
the  profiles  of  50  college  graduates  (engi- 
neering) of  age  25  or  over,  disregarding 
the  profile  itself.  The  mean  age  of  this 
group  was  31.64  years,  as  compared  with 
30.95  for  the  100  unselected  male  nor- 
mals from  the  general  population  with 
whom  they  are  to  be  compared.  The  age 
range  was  from  25  to  46,  with  a  SD  of 
5.102  years.  We,  therefore,  have  a  group 
of  males  who  are  all  college  graduates 
and  have  been  out  of  college  an  average 
of  8  years,  and  whose  mean  age  is  very 
close  to  that  of  the  general  male  normal 
sample  used  in  getting  the  tables  for  T. 

For  this  sample  of  50  male  college 
graduates  the  mean  N  score  (raw)  was 
20.9,  which  corresponds  to  a  T  score  of 
59,  which  is  exactly  that  of  the  sample  of 
college  males  still  in  school  at  time  of 
testing.  The  SD  of  raw  scores  was  6.688, 
which  differs  significantly  (P  <  .01)  from 
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the  variability  of  the  100  unsclccted  male  nosis.  That  a  T  of  60  may  mean  as  much 
normals  from  the  general  population,  on  in  terms  of  its  statistical  rarity  among 
which  the  T  scores  were  based.  The  mean  college  sophomores  as  one  of  70  in  the 
raw  score  of  20.9  differs  significantly  general  population  is  admitted  and  is 
from  the  mean  29.14  of  the  unselected  quite  trivial.  The  important  thing  to 
general  population  males,  showing  a  t  determine  is  whether  in  the  long  run  one 
of  3.10  (P  <  .01).  Unfortunately,  a  corre-  can  expect  a  college  student  scoring  60 
sponding  study  for  females  out  of  col-  on,  say,  Hs  to  have  as  many  bodily  com- 
lege  for  some  years  at  the  time  of  testing  plaints  and  be  as  refractory  to  modifica- 
could  not  be  made  due  to  lack  of  con-  tion  as  a  non-college  person  with  a  score 
venient  case  material.  Considering  the  of  70.  It  must  not  be  forgotten  that  scales 
similar  elevation  of  female  college  stu-  such  as  those  of  the  MMPI  acquire  what- 
dents,  it  is  provisionally  taken  as  prob-  ever  non-statistical  meaning  they  possess 
able  that  some  degree  of  elevation  at  from  the  clinical  description  of  those  ex- 
least  would  be  maintained  in  later  life,  treme  deviates  who  make  up  the  diag- 
although  this  will  be  investigated  when  nostic  categories  for  which  the  various 
a  suitable  group  of  profiles  is  readily  scales  are  named.  "Hypochondriasis"  re- 
available  for  study.  fers  initially  to  a  certain  behavior  syn- 

It  appears  from  this  evidence  that  the  drome  found  in  a  small  number  of  neu- 
elevated  T  scores  of  college  persons  are  rotic  persons;  the  burden  of  proof  is 
not  a  function  wholly  of  their  age,  nor  upon  anyone  who  assigns  any  other  than 
of  being  "in  college"  at  the  time  of  test-  a  somewhat  "watered"  meaning  of  this 
ing.  The  question  then  arises,  is  it  legiti-  term  in  describing  milder  deviates  scor- 
mate  and  desirable  to  employ  separate  ing  in  the  "normal"  although  elevated 
norms  in  the  application  of  N  to  the  range.  If  it  is  actually  the  case  that  col- 
college  group  (henceforth  the  "college  lege  students  and  persons  who  have  grad- 
group"  will  be  used  to  designate  college  uated  from  college  exhibit  significantly 
students  and  college  graduates  alike)  or  less  body-consciousness  and  somatic  com- 
should  the  scale  not  be  applied  to  them  plaining  than  the  general  population, 
at  all?  there  would  seem  to  be  no  good  reason 

If  we  had  some  way  of  knowing  ivhy  for  applying  a  special  set  of  norms  to 

the  college  group  deviated  from  the  gen-  college    people    based    upon    their   own 

eral  population,  we  would  know  how  to  group. 

answer  this  question.  But  since  the  rea-  In  the  case  of  the  present  scale  this 

son  for  the  deviation  is  not  known,  it  is  reasoning  is  not  so  straight-forward.  The 

difficult  to  decide.  The  proposal  has  been  function  of  the  N  scale  is  not  to  measure 

made  by  certain  college  counselors  that  a  "personality"  component  which  is  of 

a  T  score  of  60  on  one  of  the  personality  intrinsic  interest  to  us  (although  it  may 

components  "means  as  much  for  a  col-  ultimately  turn  out  to  do  so),  but  rather 

lege  sophomore  as  a  T  of  70  would  mean  to  distinguish   persons  showing  deviant 

for  a  person  from  the  non-college  gen-  profiles  but  psychiatrically  adjusted  from 

eral  poulation."  The  rationale  of  this  at-  persons  showing  equally  deviant  profiles 

titude   is  questioned  by  those  who  are  who     are     psychiatrically     maladjusted, 

most  intimately  connected  with  the  ap-  Since   we   do   not   know   how   the  scale 

plication  of  the  MMPI  in  clinical  diag-  works    (i.e.,    the   psychodynamics   of   re- 
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sponding  to  items  which  bring  about 
such  discrimination  as  empirically  oc- 
curs) it  is  not  clear  whether  separate 
norms  would  facilitate  this  in  the  case  of 
college  students  or  not.  Actually  it  would 
be  necessary  to  make  a  separate  "test" 
study  of  college  level  abnormals  and  nor- 
mals to  see  whether  college  normals  with 
deviant  profiles  score  relatively  higher 
raw  scores  on  N  than  abnormals,  in 
which  case  the  use  of  separate  college 
norms  ^\ould  be  profitable  in  practice. 
This  has  not  been  done,  mainly  because 
of  the  great  difficulty  of  selecting  cases 
from  the  abnormal  files  on  the  basis  of 
level  of  educational  attainment. 

Nevertheless,  it  is  at  least  suggestive 
to  consider  the  performance  of  the  col- 
lege "normals"  %vho  deviate  on  the  other 
personality  components  of  the  MMPI 
in  terms  of  their  N  scores  as  related  to 
college  group  means  and  variabilities. 
For  22  college  girls  showing  any  T  score 
of  70  or  greater  (excluding  Mf)  but 
"normal"  so  far  as  known,  the  median 
raw  score  on  N  is  32,  which  is  at  about 
the  86th  percentile  for  a  sample  of  100 
female  college  students,  and  the  mean 
raw  score  is  32.73  which  is  approximately 
one  SD  above  the  female  college  mean. 
Such  a  score,  on  the  other  hand,  would 
be  about  one-third  of  a  SD  below  the 
mean  raw  score   of  women  in  general 

(T  -  53). 

Among  a  group  of  over  100  employed 

male  college  giaduates  there  were  found 
9  persons  who  showed  a  T  score  of  70 
or  greater  on  any  of  the  eight  MMPI 
personality  scales  excluding  Mf.  These 
persons,  while  probably  not  so  well  ad- 
justed as  the  non-deviant  cases,  were  at 
least  "normal"  enough  to  make  an  ade- 
quate occupational  adjustment  at  a  fair- 
ly high  level  and  stay  out  of  the  hands  of 
a  psychiatrist.  Their  mean  raw  score  on 


N  was  28.67  which  corresponds  to  a  T 
of  50  for  the  general  population.  How- 
ever, in  terms  of  the  distribution  of  N 
scores  for  the  entire  group  of  employed 
college  graduate  men  in  which  they  oc- 
curred, this  mean  is  over  1.1  SD  above 
the  latter's  mean,  and  would  be  equiva- 
lent to  a  T  score  of  about  39  if  separate 
norms  were  used.  This  latter  T  score 
is  not  far  from  the  T  of  37  found  in  the 
original  criterion  group  of  male  normals 
showing  abnormal  profiles. 

Further  evidence  regarding  the  de- 
sirability of  separate  norms  for  college 
persons  will  have  to  be  collected.  Until 
this  evidence  is  forthcoming,  however, 
the  same  norms  will  be  used  in  order  not 
to  favor  the  scale  unduly;  however,  it 
will  at  least  be  pointed  out  in  what  fol- 
lows that  in  several  cases  "misses"  are 
not  clearly  "misses"  because  the  cases  in 
question  would  not  be  misses  had  the  T 
scores  on  N  been  used  been  based  upon 
special  college  norms. 

APPLICATION    OF    N-SCALE   TO    A    "tEST" 

GROUP    OF    NORMALS   WITH    ELEVATED 

NEUROTIC  TRIAD  SCORES 

As  a  further  study  of  the  differentiat- 
ing power  of  N  it  was  applied  to  a  new 
"test"  group  of  deviant-profiled  normals 
from  two  sources — a  file  from  a  group  of 
WPA  workers  and  a  set  of  scores  con- 
tributed by  a  medical  detachment  at  Fort 
Snelling  Hospital.  While  the  WPA  group 
might  be  thought  atypical  to  such  an  ex- 
tent as  to  be  unusable  for  such  pur- 
poses, the  findings  on  the  other  MMPI 
scales  and  the  item  frequencies  for  this 
group  do  not  indicate  that  they  differ 
in  any  appreciable  degree  from  the  gen- 
eral population  in  those  aspects  of  per- 
sonality sampled  by  the  inventory. 

Since  the  derivation  of  N  was  based 
upon  criterion  cases  showing  elevations 
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on  the  neurotic  triad  Hs,  D,  and  Hy, 
the  ideal  test  cases  should  be  those  show- 
ing these  elevations.  Only  a  small  num- 
ber of  such  cases  was  found  in  the  WPA 
and  army  gioups  combined,  namely,  a 
sample  of  22  male  individuals  who  were 
making  an  adequate  adjustment  psychi- 
atrically  as  far  as  was  known,  but  who  de- 
viated to  the  extent  of  T  ^  70  on  Hs, 
D,  or  Hy.  Six  cases  were  included  which 
probably  shoidd  have  been  left  out  on 
the  basis  of  information  subsequently 
collected;  three  of  these  were  found  to 
have  been  "under  a  doctor's  care"  (com- 
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As  may  be  seen  from  an  inspection  of 
the  distribution,  there  is  a  gap  from  43 
to  50  above  which  lie  seven  cases,  of 
which  five  belong  to  the  six  cases  men- 
tioned above  as  possibly  deserving  of  ex- 
clusion. The  one  case  with  T  =  64  is  a 
college  graduate  and  was  under  a  doc- 
tor's care  when  tested.  The  case  at  T  = 
56  and  that  at  T  ==  50  are  both  college 
graduates.  The  cases  at  T  =  52  and 
T  =  55  were  both  "under  a  doctor's 
care"  at  time  of  testing.  Two  of  these 
three  college  graduates  are  indicated  on 
the  attached  record  as  "nervous,"   and 
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Fig.  2.  Distribution  of  T-scores  for  22  male  "test"  normals  with  elevated  neurotic  triad  but  out  of 
the  hospital;  compared  with  scores  for  27  test  abnormals  in  the  hospital  with  diagnosed  psychoneurosis 
and  elevated  triad. 


plaint  unknown)  at  the  time  of  testing 
(a  fact  which  would  exclude  them  from 
the  "normal"  category  according  to  the 
standards  on  which  the  MMPI  was  origi- 
nally constructed)  and  the  other  three 
cases  were  college  graduates. 

The  distribution  of  these  cases  is 
shown  in  Fig.  2.  The  distribution  placed 
above  it  will  be  discussed  later.  The 
median  T  score  of  these  22  "test  nor- 
mals" with  elevated  neurotic  triad  but 
out  of  the  hospital  was  40.5,  and  the 
mean  T  score  was  42,  with  a  SD  of  9.585. 
So  this  test  group  falls  at  .8  to  1  SD  be- 
low the  mean  of  the  general  population, 
as  compared  with  the  criterion  normal 
males  at  about  1.3  SD  below. 


the  one  with  T  =  56  was  unmarried  at 
age  45.  With  these  considerations  in  mind 
it  may  be  said  that  the  "test"  group  re- 
sults compare  rather  favorably  with  the 
results  of  the  original  criterion  cases. 

Had  the  six  doubtful  cases  been  ex- 
cluded to  be  more  lenient  in  judging  the 
N  scale  instead  of  rigorous,  the  median 
T  score  would  have  been  37,  at  the  mean 
of  the  original  male  criterion  normals. 
The  mean  T  score  in  that  case  would 
have  been  37.8,  again  very  close  to  that 
of  the  criterion  normals. 

Another  way  of  looking  at  these  same 
data  would  be  to  convert  the  scores  of 
the  three  college  cases  into  appropriate 
T  scores  assuming  that  separate  college 
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norms  should  be  used.  In  this  case  the 
college  cases  with  T's  of  64,  56,  and  50 
would  instead  score  T's  of  56,  48,  and  4 1 , 
respectively.  Using  these  transformed 
scores  instead  of  the  original  ones,  the 
median  T  score  of  the  entire  22  test  nor- 
mals would  be  at  40.  Only  5  cases  would 
now  have  a  T  above  the  mean  T  of  50, 
and  no  case  would  have  a  T  score  above 
55.  (See  Table  9.) 

Table  9 

Frequency    Distriliution    for    22    Male    "Test" 
Normals  from  WPA  ami  Arm\-  Medical  Detach- 
ment   Group,    Showing    Elevated    Neurotic 
Triads.  The  College  Cases  Have  Been 
Expressed  in  Terms  of 
College  Norms 


T  Score 


20^T^29 

30^'r^39 
40  ^  T  ^  49 

5o  =  T^59 
6o^T^69 


Cumu- 

Cumu 

Fre- 

lative 

Per 

lative 

quency 

Fre- 

Cent 

Per 

quency 

Cent 

3 

3 

14 

14 

7 

10 

32 

46 

7 

17 

32 

77 

5 

22 

23 

100 

0 

22 

0 

100 

It  seems  justifiable  from  the  preceding 
analysis  to  state  that  at  least  so  far  as 
the  neurotic  triad  is  concerned,  a  "nor- 
mal" person  who  shows  an  elevated  pro- 
file but  manages  to  stay  out  of  psychi- 
atric hands  will  average  a  T  score  in  the 
neighborhood  of  40  or  about  one  SD  be- 
low (one  SD  above  in  raw  score  terms) 
the  mean  of  the  normal  population.  As 
the  T  score  of  a  deviant-scoring  profile 
rises  above  50  the  chances  of  its  owner 
being  free  of  psychiatric  involvement  to 
the  extent  of  hospitalization  become  pro- 
gressively smaller;  whereas  a  T  score  as 
high  as  60  will  make  these  chances  very 
slim.  Putting  it  in  another  way,  in  terms 
of  whatever  component  the  raw  scores 
on  N  measure,  a  person  on  the  average 
needs  an  amount  of  this  component 
about  1  sigma  above  the  mean  of  the 
general  population   if  he   is   to  remain 


"normal"  in  spite  of  elevated  scores 
(T  ^  70)  on  the  neiuotic  triad.  As  one 
reaches  and  then  begins  to  fall  below 
the  general  popidation  raw  score  mean 
on  this  factor,  the  odds  increase  that  one 
is  not  a  member  of  the  "normal"  group. 
If  one  scores  above  70  on  Hs,  D,  or  Hy 
and  falls  as  much  as  one  SD  below  the 
mean  of  the  general  population,  the 
chances  that  he  is  psychiatrically  in- 
volved are  high. 

It  should  be  pointed  out  again  at  this 
time  that  the  great  crudity  of  division 
into  "normal"  and  "abnormal"  on  the 
basis  of  the  only  available  objective  cri- 
terion "intramural"  or  "extramural" 
with  reference  to  a  psychopathic  ward 
inevitably  worsens  the  apparent  difler- 
entiating  power  of  a  scale,  particularly 
one  such  as  the  present.  Although  prac- 
tically all  persons  who  are  "in"  a  mental 
hospital  with  psychosis  or  psychoneu- 
rosis  deserve  to  be  there,  the  converse  is 
not  true  for  those  who  are  "outside."  De- 
tailed study  of  the  individuals  used  in 
testing  such  a  scale  would  almost  invari- 
ably work  in  its  favor  by  finding  inde- 
pendent evidence  of  psychiatric  difficul- 
ties in  persons  otherwise  counted  as  "ex- 
ceptions" to  a  rule.  As  an  example  of 
this  I  might  mention  one  of  the  few  cases 
in  which  a  person  who  would  be  counted 
as  such  a  test  "miss"  was  accessible  to  me. 
I  was  shown  a  profile  of  a  college  student 
with  an  elevation  in  the  region  of  T  = 
90  on  the  Hy  scale,  with  Hs,  D,  and  Mf 
also  up  but  not  above  70.  I  scored  his 
test  on  the  N  scale  and  found  to  my  dis- 
comfiture (this  being  before  the  extent 
of  college  atypicality  on  N  was  realized) 
that  his  N  score  was  about  65,  contrary 
to  a  provisional  hypothesis  that  60  was 
a  rough  limit  of  T  scores  on  N  to  stay 
out  of  the  hospital  if  one  had  a  deviant 
profile    on    one    of    the    other    scales.    I 
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mentally  recorded  this  case,  which  I  had 
come  across  more  or  less  accidentally  in 
a  social  way,  as  a  "miss"  for  the  N-scale. 
Shortly  thereafter  I  had  occasion  to  meet 
the  individual  in  question  personally, 
without  his  knowing  that  I  had  seen  his 
MMPI  profde.  In  the  course  of  conver- 
sation I  inquired  as  to  his  draft  status. 
He  informed  me  that  he  had  been  in  the 
army  but  had  been  discharged  for  "func- 
tional asthma."  I  asked  whether  he  had 
any  other  ailments,  and  he  went  on  to 
describe  the  terrible  headaches  he  had  so 
frequently  for  which  the  doctors  could 
find  no  explanation,  the  backaches  which 
were  equally  troublesome,  and  the  fact 
that  he  had  had  insomnia  and  was  quite 
a  "nervous  person."  Had  this  opportunity 
for  direct  clinical  contact  not  presented 
itself,  this  case  w'ould  have  appeared 
merely  as  a  misplaced  X  on  a  freqviency 
distribution  and  called  a  "miss"  for  the 
scale. 

In  the  evaluation  of  the  results  apply- 
ing N  to  the  "test"  group  of  normal 
deviates  on  the  neurotic  triad,  one  needs 
to  consider  the  fact  that  in  the  original 
criterion  group  of  abnormals,  the  pro- 
files were  in  a  sense  "artificially"  held 
down  by  the  nature  of  the  matching 
technique  involved.  That  is,  due  to 
the  rarity  of  extremely  elevated  T 
scores  among  normals,  the  matched 
profiles  of  the  criteron  abnormals  were 
simply  not  allowed  in  the  matching 
process  to  attain  the  gieat  elevations 
(T  =  90  and  up)  which  do  occur  in  the 
hospital  neurotic  population.  In  a  cer- 
tain sense  this  is  of  no  consequence, 
inasmuch  as  if  it  is  established  that  T 
scores  above  this  upper  limit  almost 
literally  never  occur  in  a  valid  normal 
profile  the  problem  of  distinguishing 
"normals"  with  such  elevations  does  not 
exist.   That   is,   that  a  scale  such  as  N 


should  "work"  only  within  the  semi- 
pathological  range,  say,  of  T  =  70  to 
T  =  90  (which  includes  two  full  stand- 
ard deviations)  would  be  no  cause  for 
complaint  if  everyone  scoring  above  90 
were  practically  certain  to  be  pathologi- 
cal. It  might  be  objected  that  the  normal 
curve  integral  does  not  decrease  very 
rapidly  between  a  score  at  two  sigma  and 
a  score  at  four  sigma.  This  statement  has 
a  superficial  plausibility,  but  it  is  based 
upon  the  erroneous  assumption  that  the 
"probabilities"  in  question  are  those  of 
a  normal  curve  of  distribution  for  the 
"normal"  population.  The  point  must 
be  clear  that  the  "probability  of  a  man's 
being  correctly  called  abnormal  from 
his  profile  alone"  is  not  a  function  of  the 
normal  distribution  alone  but  depends 
to  an  equal  degree  upon  the  distribution 
of  scores  for  abnormal  persons  and  de- 
pends most  of  all  upon  the  actual  rela- 
tive frequency  of  normals  and  abnormals 
as  defined  by  the  non-test  criterion.  The 
solution  of  this  problem,  which  is  really 
one  of  inverse  probability,  requires  the 
application  of  Bayes'  Rule  and  as  usual 
the  relevant  knowledge  for  this  applica- 
tion is  inaccessible  to  us  in  practice.  In 
point  of  fact,  if  one  calls  a  case  "abnor- 
mal" (by  some  arbitrary  non-test  dividing 
criterion)  whenever  one  finds  a  T  above 
the  5  per  cent  level,  he  will  not  be  right 
95  per  cent  of  the  time  by  any  means. 
The  relative  frequency  of  clinically  nor- 
mal and  abnormal  persons  is  such  that 
the  best  guess  is  probably  "normal"  with 
any  T  score  between  70  and  80.  There 
may  only  be  5  persons  in  100  who  score 
above  70  on  Sc  among  the  normal  popu- 
lation; but  there  are  much  fewer  than 
5  schizophrenics  in  100  among  the  gen- 
eral population. 

To  illustrate  this  by  a  specific  example, 
consider  a  population  in  which  the  rela- 
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live  frequency  of  actual  schizophrenics 
(defined,  say,  by  the  presentation  upon 
careful  psychiatric  examination  of  symp- 
toms so  marked  that  the  examiner  would 
be  willing  to  attach  this  as  a  diagnosis) 
is  as  high  as  5  per  cent,  which  is,  of 
course,  much  higher  than  really  exists. 
Assume  that  it  is  as  rare  for  an  actual 
schizophrenic  to  hold  his  T  score  on 
Sc  doxL'H  to  70  as  it  is  for  a  "normal" 
person  to  achieve  a  score  as  high  as  70 
or  above,  say,  again  5  per  cent.  Then  if 
one  were  to  draw  a  sample  of  persons  at 
random  from  such  a  population  and 
label  every  case  a  schizophrenic  who 
showed  a  score  ^  70,  he  woidd  not  be 
right  by  any  means  in  95  cases  out  of 
100  but  in  about  67  out  of  too.  The 
"probability  that  a  man  scoring  as  high 
as  70  is  a  schizophrenic"  is  only  .67,  and 
1  in  3  times  such  a  diagnosis  will  be 
erroneously  applied  to  a  man  who  is  in 
fact  "normal." 

For  this  reason  it  is  quite  reasonable 
to  concern  oneself  with  the  group  of  nor- 
mals scoring  between  70  and,  say,  90, 
as  of  sufficient  frequency  to  justify  study, 
even  though  the  mathematical  properties 
of  the  normal  curve  (which  do  not  apply 
to  MMPI  scales  for  the  most  part  any- 
way) are  such  as  to  indicate  to  a  super- 
ficial examiner  the  practical  uselessness 
of  studying  the  group  in  that  range.  Out 
of  1000  "normal"  cases  in  the  files  one 
may  expect  to  find,  say,  50  profiles  show- 
ing scores  above  70  on  any  given  scale, 
and  yet  not  find  a  single  case  (except  in- 
valid profiles  with  high  F)  scoring  above 
90.  As  was  pointed  out  earlier  in  the 
introductory  remarks,  it  is  this  group  of 
50  persons  at  whom  the  present  study  is 
aimed. 

Nevertheless,  the  mean  profile  of  the 
criterion  abnormals  is  almost  completely 
within  the  range  of  two  standard  devia- 


tions usually  called  "normal,"  and,  there- 
fore, it  is  worth  while  to  study  the  N 
scores  of  a  group  of  "test"  abnormals  of 
appropriate  neiuotic  triad  diagnosis  and 
test  scores  whose  profiles  are  allowed  to 
vary  freely  into  the  higher  ranges.  The 
distribiuion  of  these  abnormals  must 
be  compared  with  that  for  the  22  test 
normals  we  have  just  finished  consider- 
ing in  terms  of  their  relation  to  the  gen- 
eral population  norms  alone. 

From  the  abnormal  file  were  selected 
at  random  27  male  abnormals  with  an 
elevated  neurotic  triad  (Hs,  D,  or  Hy  ^ 
70)  and  a  clinical  diagnosis  in  harmony 
with  such  a  configuration  in  that  the 
diagnosis  was  required  to  be  psycho- 
neurosis,  either  hypochondriasis,  hys- 
teria, reactive  depression,  or  mixed  type. 
The  group  thus  selected  turned  out  to 
consist  of  1 1  hypochondriacs,  8  hysterics, 
6  psychoneurosis  mixed,  and  2  reactive 
depressions.  Cases  with  ?  or  F  scores  over 
70  were  not  allowed.  This  group  of  "test" 
normals  is  also  stringent  in  its  require- 
ments for  the  present  scale,  because 
whereas  deviations  in  the  upward  direc- 
tion were  unrestricted  in  this  group,  all 
cases  used  as  test  abnormals  showed  ele- 
vations on  the  neurotic  triad,  whereas 
a  number  of  hospitalized  abnormals  with 
such  a  diagnosis  fail  to  do  so. 

The  mean  raw  score  on  N  for  this 
group  of  abnormals  was  26.44,  which 
corresponds  to  a  T  score  of  53.  The  SD 
of  the  raw  scores  was  7.057.  The  mean  of 
53  is  about  1/3  sigma  below  the  mean  T 
score  of  56  found  in  the  male  criterion 
abnormals,  whose  profiles  were  "held 
down"  artificially  by  the  matching  tech- 
nique; but  the  diff^erence  is  not  signifi- 
cant (t  =  1.507,  p  >  .10). 

The  distribution  for  these  27  random 
and  unrestrained  profiles  was  presented 
in  Fig.  2  for  comparison  with  the  distri- 
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bution  of  N  scores  for  the  22  male  "test" 
normals.  It  can  be  seen  that  the  overlap 
is  considerable,  6  of  the  test  normals 
reaching  or  exceeding  the  median  of  the 
abnormals.  The  separation  of  this  test 
series  should  be  compared  with  the  sepa- 
ration shown  by  the  distributions  for  the 
criterion  normals.  It  should  be  pointed 
out  again  that  the  highest  scoring  seven 
test  normals  include  three  college  gradu- 
ates one  of  whom  was  under  a  doctor's 
care  and  two  other  persons  under  a  doc- 
tor's care  for  unknown  ailments.  With 
this  in  mind  the  overlap  does  not  appear 
quite  so  great. 

In  spite  of  these  handicaps,  the  differ- 
ence between  the  means  of  these  two  dis- 
tributions is  10.37  ^^^  score  points,  which 
is  statistically  significant  (t  =  4.268,  P  < 
.01).  It  can  be  said  with  confidence  that 
even  if  the  matching  process  did  hold 
down  the  T  scores  of  the  criterion  ab- 
normals to  a  slight  degree  still  it  did 
not  hold  them  down  to  such  an  extent 
that  the  differentiation  is  removed  when 
the  profiles  are  not  so  restrained. 

A  comparable  test  study  with  females 
could  not  be  carried  out  because  of  the 
very  small  number  of  female  normals 
with  elevated  neurotic  triads  available 
other  than  those  already  used  in  the  cri- 
terion study.  There  were  found  only 
seven  female  cases  in  the  WPA  sample 
with  elevated  triads,  of  whom  one  was 
a  Negro  college  graduate  (social  worker) 
and  another  was  under  a  doctor's  care, 
considered  "nervous,"  and  separated 
from  her  husband  at  the  time  of  testing. 
This  left  only  five  cases  for  whom  both 
clinical  normality  and  applicability  of 
norms  could  be  reasonably  assumed.  The 
mean  T  score  of  all  seven  cases  is  at  50 
— the  mean  of  the  general  population. 
If  the  college  case  is  changed  to  college 
group  T-score,  the  mean  would  be  48. 


Whether  female  test  cases  would  show  a 
differentiation  better  or  worse  than  the 
males  cannot  be  predicted  upon  present 
evidence. 

APPLICATION  TO  CASES  WITH  ELEVA- 
TIONS  OTHER   THAN    ON   THE 
NEUROTIC  TRIAD 

Due  to  the  small  number  of  deviant 
profiles  available  for  study  in  the  normal 
file,  it  was  not  possible  to  apply  the  N- 
scale  to  several  new  "test"  groups  of  typi- 
cal normals  with  deviant  profiles  on 
the  neurotic  triad,  as  would  have  been 
the  best  procedure.  However,  it  will  be 
remembered  that  in  the  original  selec- 
tion of  deviant  scoring  normals,  study 
was  confined  to  those  showing  deviations 
on  the  neurotic  triad,  Hs,  D,  and  Hy. 
The  remaining  set  of  deviant  scoring  nor- 
mals constitute  a  possible  "test"  group, 
although  they  have  the  disadvantage  of 
testing  two  hypotheses  at  once.  For  if  the 
N-scale  fails  to  discriminate  them  in  the 
desired  direction,  it  would  be  unclear 
whether  this  was  because  too  many  of  the 
original  items  were  merely  selected  due 
to  sampling  errors,  or  whether  the  same 
"suppression  variables"  are  simply  not  ap- 
plicable to  the  other  scales  as  to  the  neu- 
rotic triad. 

In  order  to  get  a  sufficient  number  of 
cases  it  was  necessary  to  relax  somewhat 
the  previously  stringent  restrictions  as 
to  validity.  In  this  group,  F  scores  were 
allowed  to  rise  above  60  before  discard- 
ing a  profile  as  invalid,  although  no  case 
retained  had  an  F  over  66.  There  was 
also  an  examination  of  the  profile  as  to 
its  "form,"  which  is,  for  persons  ac- 
quainted with  the  clinical  application 
of  the  MMPI,  a  good  indicator  of 
whether  the  F  is  "validly"  high  or  an  in- 
dicator of  confusion  or  non-cooperation. 
(Of   course,    these    procedures   were   ap- 
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plied  before  any  determination  of  the  N 
score  was  made  on  the  cases.)  The  result 
of  this  screening  was  the  selection  of  21 
"normal"  males  showing  at  least  one  T 
score  equal  to  or  greater  than  70,  but 
not  on  any  of  the  three  neurotic  triad 
scales,  and  21  "normal"  females  with 
similarly  deviant  jnofilcs.  (The  total 
number  of  "test"  cases  is  42  by  pure  coin- 
cidence, and  it  must  be  clearly  under- 
stood that  none  of  these  42  test  cases  is 
among  the  42  of  the  criterion  group  used 


per  cent,  which  compares  very  favorably 
with  a  corresponding  83  per  cent  for  the 
criterion  group.  Here  also,  no  case  was 
found  with  a  T  greater  than  60  on  the 
N-scale.  The  summary  of  these  cases  is 
presented  in  Table  10. 

In  spite  of  the  fact  that  the  central 
tendencies  of  the  T  scores  on  N  rise 
when  the  scale  is  applied  to  the  new 
group,  the  relative  rarity  of  scores  above 
the  general  population  mean  of  50  is 
maintained.   This  woidd  suggest  a   fair 


Table  10 

Distribution  of  T- Scores  on  N-Scalc  for  21  Male  and  21  Female  "Test" 
Normals  Showing  Elevated  Profiles  Other  Than  on  Hs,  D,  or  Hy 


N 

Frequency 

Cumt 

dative 

Per  Cent 

Males 

Females 

Both 

Males 

Fema 

es 

Both 

T^20 

0 

0 

0 

0 

0 

0 

2o<T^3o 
30<T^40 
40<T<50 
50<T<6o 

3 

6 

II 

I 

0 

8 

9 
4 

3 

14 
20 

S 

14 

43 

95 

100 

0 

38 

81 

100 

7 

40 

88 

100 

6o<T^7o 

0 

0 

0 

100 

100 

100 

in  item  selection.)  Due  to  the  relaxing  of 
the  F  restriction,  three  cases  with  ele- 
vated neurotic  triad  scores  were  included 
in  this  group  who  had  previously  been 
excluded  from  the  criterion  normal  sam- 
ple. The  other  39  test  cases  deviate  on 
non-triad  components  only. 

The  mean  raw  score  on  N  of  the  21 
male  test  cases  is  38.09,  SD  7.993;  for  the 
21  females  the  mean  raw  score  is  41.10 
and  the  SD  is  6.89.  These  means  corre- 
spond to  T  scores  of  41  and  44,  respec- 
tively. So  we  see  that  in  both  sexes,  the 
mean  T  score  of  the  test  cases  rises  about 
4  points  from  that  of  the  criterion  nor- 
mals on  whom  the  item  selection  was 
based.  The  median  T  score  is  41  for 
males,  and  44  for  females;  42.5  for  the 
sexes  pooled.  The  shift  seems  to  be  from 
the  region  below  T  =  40  into  the  region 
between  40  and  50.  The  proportion  of 
cases  which  do  not  exceed  T  =  50  is  88 


applicability  of  the  scale  to  cases  other 
than  the  triad  deviates,  were  it  not  for 
the  fact  that  the  raw  score  on  the  N  scale 
correlates  to  a  rather  marked  degree 
with  Pt  and  Sc  and  to  a  lesser  but  still 
positive  extent  with  some  other  scales  in 
the  normal  as  well  as  abnormal  popula- 
tion. For  this  reason  the  mere  presence 
of  deviation  on  these  other  components 
tends  to  be  associated  with  a  deviation 
on  N  (appearing  as  a  loio  T  score)  regard- 
less of  whether  the  person  measured  is  in 
the  hospital  or  out.  Therefore,  the  tend- 
ency of  the  present  group  to  score  low  on 
N  does  not  establish  its  utility  as  applied 
to  cases  other  than  those  elevated  on 
the  neurotic  triad,  until  we  investigate 
the  magnitude  of  a  similar  lowering  ef- 
fect on  the  cases  who  are  in  the  hospital. 
"When  this  is  done  results  are  not  nearly 
so  encouraging,  and  in  fact  indicate  that 
while  stable  statistical  differences  exist. 
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the  overlap  is  too  great  to  make  the  scale 
of  any  appreciable  practical  value  in  such 
cases. 

A  group  of  profiles  for  25  males  and  25 
females  was  extracted  from  the  file  of  ab- 
normals,  the  requirement  being  that  the 
profile  nuist  deviate  on  some  other  varia- 
ble besides  the  neurotic  triad,  that  the 
peak  scores  must  be  on  some  other  varia- 
ble, and  that  the  clinical  diagnosis  must 
not  be  hysteria,  hypochondriasis,  or  de- 
pression. Profiles  with  ?  or  F  scores  above 
70  were  also  excluded  as  probably  in- 
valid. These  cases  were  of  various  diag- 
nostic groups,  including  psychoneuroses 
other  than  those  mentioned  above, 
manic-depression,  schizophrenia,  psycho- 
pathic personality,  paranoid  condition, 
behavior  problem,  and  simple  adult  mal- 
adjustment. The  sex  difference  which 
appears  in  the  goodness  of  separation  is 
such  as  to  warrant  separate  treatment. 

For  the  25  male  abnormals  the  mean 
raw  score  was  29.92,  corresponding  to  a 
T  score  of  46,  and  a  median  raw  score 
of  31  (T  ==  48).  For  the  25  female  ab- 
normals the  mean  raw  score  was  29.92 
corresponding  to  a  T  score  of  56,  and  a 
median  raw  score  of  31  (T  =  55).  These 
comparisons  are  presented  in  Table  11. 

In  spite  of  the  significance  of  these  dif- 
ferences from  a  statistical  point  of  view, 
the  overlap  is  excessively  great  and  the 


differentiation,  while  indicating  some- 
thing of  theoretical  interest  regarding 
the  generality  of  at  least  part  of  the  N 
scale  as  a  suppressor,  is  not  sufficient  to 
be  of  practical  utility.  Inspection  of  the 
[)lotted  distributions  (not  presented  here) 
indicates  that  the  difference  of  the  sexes 
lies  in  the  much  greater  tendency  of  the 
male  abnormals  to  achieve  low  T  (high 
raw)  scores  than  is  the  case  with  the  fe- 
male abnormals.  As  regards  the  normals, 
in  both  sexes  the  "test"  normals  tend  to 
remain  below  the  population  average; 
and  this  tendency  appears  also  when  they 
are  compared  with  the  central  tendency 
of  the  50  abnormals.  In  the  latter  com- 
parison, we  find  only  14  per  cent  of  the 
female  normals  reach  or  exceed  the  me- 
dian T  score  of  the  abnormals;  and  19 
per  cent  of  the  male  normals  reach  or 
exceed  the  median  T  score  of  the  abnor- 
mals. But  at  the  other  end  of  the  distri- 
bution whereas  only  8  per  cent  of  the 
female  abnormals  fall  as  low  as  the  me- 
dian of  the  female  normals,  we  find  that 
fully  1/3  of  the  male  abnormals  fall  as 
low  as  the  median  of  the  male  normals. 
In  other  words,  if  one  has  a  T  score 
above  50  on  the  N  scale  with  any  nontriad 
component  deviating  to  a  T  of  70  or 
more,  he  is  likely  to  be  abnormal.  If  one 
has  a  T  above  60  on  N  in  such  a  case,  he 
is  practically  certain  to  be  abnormal.  But 


Table  ii 

Comparisons  of  the  42  "Test"  Normals  with  Non-Triad  Elevations  and  50 

Random  Abnormals  with  Non-Triad  Elevations 


Group 


Median 
Raw 
Score 


Mean 
Raw 

Score 


Mean  T 
Score 


Mean 
Raw 

Score 
Diff 


tdiff 


21  "test"  normals 
25  abnormals 
21  "test"  normals 
25  abnormals 


38 
31 

31 


38.10 
32  .60 
41 .  10 
29.92 


Males 
41 

46 

Females 

44 

56 


5-5° 


II. 18 


2.135 


1 10 


.o5>P>  .02 


P<  .01 
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a  loiu  T  score,  at  least  for  males,  by  no  those  items  which  must  be  operative  in 
means  guarantees  that  one  will  not  be  ab-  producing  such  discrimination  as  does 
normal,  whereas  it  does  seem  to  aflord  exist  to  show  up  as  clearly  powerful  dif- 
some  such  assurance  in  the  case  of  fe-  ferentiators  when  their  influence  is  sys- 
males.  tematically  masked  by  items  showing  a 
The  net  result  of  the  investigation  of  higli  correlation  with  the  scales  for  ab- 
this  test  group  can  be  summed  up  as  fol-  normal  components  which  are  being  con- 
lows:  The  mean  T-scores  on  N  of  "nor-  sidered.  That  even  with  the  existence  of 
mals"  showing  elevations  on  other  scales  such  high  correlations  with  Pt  and  Sc  as 
than  the  neurotic  triad  is  somewhat  will  later  be  indicated,  the  scale  is  able 
higher  than  that  of  the  criterion  group  to  produce  a  statistically  although  not 
of  normals.  The  appearance  of  T  scores  practically  significant  differentiation  sug- 
above  50  is  equally  rare  in  the  test  group,  gests  that  a  fiuther  search  for  a  highly 
as  is  also  the  case  for  scores  above  60.  The  generalized  suppression  variable  would 
incidence  of  scores  in  the  40  to  50  range  be  rewarded. 

increases  over  that  of  the  criterion  nor-  It  might  be  thought  that  the  exclu- 
mals.  The  differentiation  of  normals  sion  of  cases  showing  T  above  70  on  ?  or 
from  abnormals  is  not  nearly  so  good  as  F  from  the  "test"  normals  is  undesir.able 
this  finding  might  suggest,  because  ab-  since  such  profiles  do  occur  in  practice, 
normals  ^vith  deviations  and  diagnoses  The  present  position  is  that  for  this  pur- 
other  than  on  the  neurotic  triad  obtain  pose  (evaluating  differentiation)  every  ef- 
considerably  lower  T  scores  on  N  than  fort  should  be  made  to  include  only 
the  criterion  abnormals  and  are  in  the  valid  profiles,  and  that  the  clinical  sig- 
case  of  the  male  abnormals  actually  be-  nificance  of  apparently  invalid  ?  and  F 
loAv  the  mean  T  of  the  general  popula-  scores  may  be  subsequently  determined 
tion.  There  is  a  statistically  significant  once  the  power  of  the  scale  is  clearly 
difference  between  the  mean  T  scores  seen.  If  the  normal  records  called  invalid 
of  normals  and  abnormals  with  non-  because  of  their  vmusual  form  combined 
triad  elevations,  but  the  overlap  is  great,  with  suspiciously  high  ?  or  F  scales  were 
particularly  in  the  case  of  the  males.  It  included,  the  differentiation  woidd  not 
does  not  appear  that  the  amount  of  dis-  become  worse  but  on  the  contrary  would 
crimination  is  sufficient  to  have  any  hope  have  been  greatly  improved.  A  group 
that  the  N  scale  will  be  of  practical  util-  of  26  cases  originally  excluded  from  the 
ity  for  these  groups  in  its  present  form,  "test"  normals  because  of  invalid  look- 
The  statistical  stability  of  the  differentia-  ing  profiles  and  high  F  or  ?  scores  earns 
tion,  however,  suggests  that  further  re-  a  mean  T  score  of  37.31  (38.64  for  males, 
finement,  especially  in  the  removal  of  35-75  for  females)  which  is  considerably 
those  items  which  contribute  most  to  the  lower  than  the  value  for  these  means  ac- 
high  correlation  with  some  of  the  other  tually  obtained  by  the  more  rigorous 
scales,  might  improve  this  state  of  af-  selection.  Among  these  26  cases  no  pro- 
fairs.   It  is,  of  course,  very  difficult  for  file  shows  a  T  above  50  on  the  N  scale. 
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Chapter  V 

Miscellaneous  Properties  of  N  and  Its  Relation 
TO  Certain  Other  Variables 

N   considered  as  an   inverted   "lie"  eluding  psychoneuroses   (all    types),   de- 

SCALE  in  terms  OF  THE  SCORES  OF  prcssion,      psychopailiy,     schizophrenia, 

A  GROUP  OF  clinical  abnormals  mania.  The  mean  raw  score  on  N  for 

showing  normal  mmpi  these  35  patients  with  "normal"  appear- 

PROFILES  jj^g  profiles  was  26.63,  ^^^  the  SD  9.565. 

N  VIEW  of  the  slight  though  insignifi-  The  T  score  corresponding  to  this  mean 

cant  difference  between  the  N  scores  would  be  about  59.  Another  group  of  16 

of  abnormals  with  freely  varying  profiles  female  abnormals  (this  time  showing  no 

and    those    wath    profiles    "held    down"  scores  above  60)  taken  from  files  for  re- 

artificially    by    matching,    together   with  cent  patients  shows  a  mean  raw  score  of 

the  negative  correlations  later  to  be  re-  25.5,  corresponding  to  a  T  of  60.  For 

ported  between  the  two  chief  "lie"  scales  the  entire  group  of  51  female  abnormals 

and  raw  score  on  N,  it  seemed  possible  with  normal  profiles,  the  mean  raw  score 

that  giving  few  responses  scored  on  N  is  26.27,  corresponding  to  a  T  score  of 

might  be  allied   to   the  same   tendency  60,    and    the   SD    is   9.594.   This   mean 

which  gives  rise   to  elevated  lie  scores,  raw  score  differs  significantly  from  that 

Furthermoie,  there  are  actually  7  of  the  of   the    100   unselected   normal    females 

22  items  on  Lg  (an  unpublished  lie  scale  (t  =  5.515,  P  <  .01). 
in  process  of  development)  which  appear  A  similarly  chosen  group  of  26  male 

on  the  N  scale  also,  scored  in  the  oppo-  abnormals    showing    no    score    over    60 

site  direction  in  every  case.  One  partial  showed  residts  in  the  same  direction  but 

test  of  such  a  hypothesis  is  to  determine  of  lesser  magnitude.  The  mean  raw  score 

whether  the  T  scores  on  N  of  clinical  for  male  abnormals  with  normal  profiles 

abnormals    showing    "normal"    profiles  was  24.92,  corresponding  to  a  T  for  males 

tend  to  be  elevated.  of  only  54,  and  a  SD  of  7.4519.  This  raw 

The  file  of  profiles  for  hospitalized  fe-  score  mean  is  of  borderline  significance 
male  abnormals  was  entered  at  random  statistically,  the  t  in  comparing  it  with 
and  the  first  35  consecutive  cases  failing  the  mean  of  29.14  for  unselected  normal 
to  show^  any  MMPI  personality  compo-  males  being  2.079  with  124  d.f.,  so  that 
nent  over  65  were  extracted.  (Actually,  .02  <  P  <  .05.  Why  there  should  be  a 
the  first  19  cases  so  selected  showed  no  T  smaller  difference  in  the  case  of  males 
score  over  60  on  any  MMPI  scale;  but  it  is  not  clear,  nor  is  it  clear  why  the  comb- 
was  necessary  to  raise  the  boundary  to  65  ing  of  over  300  male  abnormal  profiles 
in  order  to  get  more  cases.)  All  cases  with  and  500  female  should  have  resulted  in 
?  over  58  were  eliminated  from  considera-  a  rather  pronounced  difference  in  the 
tion,  since  elevated  ?  scores  frequently  pattern  of  frequencies  for  clinical  diag- 
mean  spuriously  low  profiles  achieved  by  noses  among  those  managing  to  hold 
simply  failing  to  answer  large  numbers  their  profiles  down  to  "normal."  Over 
of  significant  items.  The  clinical  diag-  half  of  the  females  in  this  study  were 
noses  of  these  patients  were  variable,  in-  diagnosed  psychoneurosis,  3  were  schizo- 

41 


42 


PAUL    E,    MEEHL 


phrenics,  3  psychopaths,  2  each  of  simple 
adult  maladjustment,  involutional  de- 
pression, manic  depressive  depression, 
and  1  each  of  paranoid  state,  addiction, 
behavior  problem,  and  manic.  In  the 
case  of  a  samj)le  of  males  taken  in  the 
same  way  (consecutive  cases  in  the  file 
showing  "normal"  profdes)  there  were 
only  5  psychoncuroscs,  5  schizophrenias, 
7  psychopaths,  2  n\anic  depressed,  2  para- 
noid state,  2  homosexual,  2  alcoholic, 
and  1  behavior  problem.  Quite  possibly 
the  difference  in  the  relative  frequency 
of  cases  which  would  be  expected  to  show 
elevations  on  the  neurotic  triad  is  the 
major  cause  of  the  difference  fovmd  be- 
tween the  sexes  in  the  functioning  of  N 
as  a  sort  of  "lie"  scale. 

The  median  T  score  of  the  51  female 
abnormals  with  normal  profiles  was  60, 
and  only  14  per  cent  showed  T  scores 
below  50.  In  the  case  of  the  males  the 
median  was  53.5  and  27  per  cent  showed 
a  T  score  below  50. 

In  summary,  it  may  be  concluded  that 
whatever  the  mechanism  may  be,  the 
N  scale  functions  to  a  significant  extent 
inversely  as  a  variety  of  "lie"  scale  not 
in  the  sense  of  indicating  deliberate  and 
conscious  deception  but  some  tendency, 
however  produced,  to  avoid  putting  one- 
self in  an  unfavorable  or  abnormal  light 
when  answering  the  items  on  MMPI. 
Female  abnormals  who  fail  to  show  any 
of  their  personality  components  on  the 
eight  scales  deviating  by  more  than  1.5 
sigma  (T  =  65)  tend  on  the  average  to 
reach  a  T  of  60  on  the  N  scale.  Male  ab- 
normals with  similarly  "normal"  profiles 
otherwise  show  the  same  tendency  to  ele- 
vated T  scores  on  N  but  to  a  consider- 
ably weaker  degree,  averaging  a  score 
only  about  3/10  of  a  sigma  above  the 
mean  of  men  in  general  (T  =  54).  Only 
about  1  female  abnormal  in  6  or  7  can 


hold  her  profde  down  to  65  on  all  scales 
and  keep  her  N  score  below  T  =  50. 
About  I  male  abnormal  in  4  is  able  to 
achieve  this,  however. 

The  occinrence  of  T  scores  in  the  re- 
gion 55  to  65  is,  of  course,  so  frequent 
among  the  general  normal  population 
that  the  present  scale  could  not  have  any 
practical  utility  in  the  detection  of  lying 
when  insj)ecting  profiles  for  abnormality. 
It  is  only  after  being  possessed  of  the  in- 
formation that  the  patient  is  abnormal 
in  spite  of  his  apparently  normal  profile 
that  it  occurs  to  us  to  interpret  an  N  of, 
say,  55  in  such  a  way  as  to  "explain"  the 
low  profile.  The  chief  importance  of  the 
present  finding  regarding  the  function  of 
N  in  a  rough  way  comparable  to  L  and 
L,,  is  with  reference  to  its  ultimate  the- 
oretic interpretation. 

RESULTS   OF   APPLICATION   OF   N   TO 
UNSELECTED    ABNORMALS 

Upon  the  Rosanoff  hypothesis  which 
originally  suggested  the  present  inves- 
tigation, one  might  plausibly  argue  that 
if  the  N  scale  were  measuring  even  in  a 
crude  way  this  so-called  "normal"  con- 
trolling component  of  personality,  the 
mean  score  of  a  group  of  persons  defined 
simply  by  their  psychiatric  abnormality 
should  be  elevated  (i.e.,  raw  scores  on  N 
should  be  low).  The  elevation  of  T 
scores  on  N  might  not  be  expected  to  be 
as  great  for  unselected  abnormal  per- 
sons as  for  those  in  the  criterion 
abnormal  group  even  on  Rosanoff's 
hypothesis.  For  since,  as  was  pointed 
out  previously,  the  criterion  abnormals 
had  their  MMPI  profiles  "artificially" 
held  down  by  the  matching  process,  we 
might  in  effect  have  been  selecting  these 
abnormals  who  were  clinically  abnormal 
in  spite  of  fairly  small  amounts  of  devia- 
tion in  the  specific  abnormal  components 
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which  N  is  supposed  to  control.  Such  a 
group  would  be  expected  on  Rosanoff's 
hypothesis  to  be  characterized  by  an  ex- 
tremely small  amount  of  the  "normal" 
component,  since  such  relatively  slight 
deviations  on  the  MMPI  scales  (assuming 
these  scores  for  the  moment  to  be  valid) 
have  resulted  in  their  being  psychiatri- 
cally  incapacitated. 


male  abnormals  is  actually  higher  than 
the  corresponding  raw  score  mean  for 
the  male  normals,  but  this  difference  is 
not  significant  statistically.  The  vari- 
ances do  not  differ  significantly  by  the  F- 
test. 

Because  of  the  sex  difference  these  re- 
sults are  difficult  to  interpret  in  any  con- 
sistent way.  It  would  seem  that  the  fail- 


Tablf.  12 

Comparisons  of  Means  and  Variances  of  50  Random  Abnormals 
of  Both  Sexes  with  the  General  Population  Sample 


Group 


N 


Mean 
Raw 


SD 


Diff. 
of  Means 


C.R. 


Male  normals 

100 

29.14 

50 

9-55° 

Male  abnormals 

5° 

30.14 

49 

9-457 

Female  normals 

100 

35-59 

50 

9.804 

Female  abnormals 

50 

30.62 

55 

10. 284 

1 .00 


4-97 


.608 


2.834 


P=.23 


P<.oi 


Following  this  line  of  reasoning  we  se- 
lected 50  abnormals  from  both  sexes 
from  the  file  of  hospitalized  patient's  pro- 
files. Cases  with  organic  involvement  or 
with  diagnosis  unrecorded  or  doubtful 
were  excluded.  No  restrictions  were 
placed  on  ?,  L,  and  F  in  this  sample,  be- 
cause of  another  use  to  which  it  was  in- 
tended to  put  the  data.  The  diagnoses 
were,  of  course,  very  heterogeneous,  as 
was  intended,  the  largest  single  category 
being  psychoneurosis. 

The  means  and  the  variabilities  of 
these  abnormals  are  given  in  Table  12, 
together  with  a  test  of  the  significance 
of  their  deviations  from  the  unselected 
normal  sample  upon  whom  the  T  scores 
were  based. 

We  find  that  the  mean  raw  score  of 
the  female  abnormals  is  about  I/2  an 
SD  below  that  of  the  female  normals  on 
which  the  T  scores  are  based,  and  that 
this  difference  is  statistically  significant 
at  the  1  per  cent  level.  The  mean  of  the 


lue  to  obtain  any  difference  in  the  case 
of  the  males  constitutes  stronger  evidence 
against   any   interpretation   in   terms  of 
Rosanoff's    theory   of   control    than    the 
slight  although  stable  difference  in  the 
case  of  the  females  constitutes  evidence 
in  favor  of  such  an  interpretation.   In 
view  of  the  other  findings  on  the  average 
amount  of  deviation  found  in  abnormals 
in  MMPI  research,  a  deviation  of  .5  sig- 
ma  does  not  lend  much  support  to  the 
suggestion  that  these  female  abnormals 
are  clinically  abnormal  because  they  do 
not    have    enough    of    the    hypothetical 
"control"  or  normality  variable  to  stabi- 
lize themselves.  It  must  be  remembered 
that  this  T  score  of  55  is  still  almost  1/2 
SD   nearer   to   the   mean   of  unselected 
normals  than  tlie  mean  of  female  college 
students.  The  smallness  of  the  deviation 
and  the  lack  of  any  difference  in  the  case 
of  the  male  group  lead  us  to  doubt  seri- 
ously (especially  in  conjunction  with  the 
evidence  from  intercorrelations  and  item 
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content   to  be   discussed   later)   whether      group  (16-25)  ^^'t^ie  medical  students  and 
the  results  can  be  construed  as  support-      such  a  college  group,  as  pointed  out  pre- 


ing  the  original  h)pothesis.  That  there 
is  any  deviation  at  all  in  the  case  of  the 
one  sex  can  be  "explained"  on  simpler 
groimds  than  the  RosanoiT  hypothesis, 
as  will  be  seen  shortly. 

THE  RELATIONSHIP  OF  N  TO  AGE  AND  TO 

THE  OTHER  VARIABLES  OF 

THE  MM  PI 

Fmthcr  evidence  against  any  interpre- 
tation of  N  as  a  positive  "normality" 
component  of  personality  comes  from  a 
study  of  its  relation  to  the  other  MM  PI 
scales  and  to  chronological  age.  For  the 
100  ranflom  male  normals  on  whom  the 


viously,  tend  to  show  low  raw  scores  on 
N. 

The  intercorrelation  of  the  N  scale 
with  the  other  scales  of  MMPI  and  with 
a  few  luijjublishcd  or  abandoned  scales 
was  also  investigated.  In  I'able  13  below 
are  presented  the  correlation  coefficients 
for  raiu  scores  on  N  with  certain  other 
measures  on  MMPI,  for  the  sexes  sepa- 
rately and  for  normals  and  abnormals 
separately.  The  variable  C,,  is  the  no 
longer  employed  "correction  scale"  for 
the  old  hypochondriasis  key  (10),  and  L„ 
is  a  more  subtle  "lie"  scale  which  is  still 
in  the  process  of  validation. 


Table  13 
Correlation  of  Raw  Score  on  N  with  Other  Variables 


c,, 

u 

L 

F 

Hs 

D 

Hy 

Pd 

Pa 

Pt 

So 

Ma 

100  male  normals 

•79 

-.76 

-•17 

•43 

•S8 

.02 

-•25 

•33 

.27 

.84 

•71 

•37 

100  female  normals 

.78 

-•6s 

—  .  II 

.28 

•54 

.46 

.  II 

■33 

.  21 

.82 

.72 

•31 

50  male  abnormals 

.82 

-.82 

-.28 

.60 

■55 

•47 

.  21 

.46 

•58 

.80 

•74 

■36 

50  female  abnormals 

.78 

-.84 

—  .  21 

.42 

•37 

•56 

•17 

•37 

•53 

.80 

.69 

•31 

T  scores  were  standardized,  there  is  evi- 
dence of  a  progressive  rise  in  mean  N 
score  with  chronological  age  in  the  gen- 
eral population.  For  the  age  range  16-25 
the  mean  N  raw  score  was  23.73,  ^^  com- 
pared with  a  mean  of  34.30  for  the  age 
group  36-45  in  the  same  normal  sample. 
For  all  100  male  normals  there  is  a  cor- 
relation of  .38  betw-een  chronological  age 
and  raw  score,  which  is  significantly  dif- 
ferent from  zero  (P  <  .01).  For  the  100 
female  normals  this  correlation  is  only 
.16,  which  is  not  statistically  significant 
in  its  deviation  from  zero  (P  >  .10).  The 
means  for  females  also  do  not  show  as 
much  change,  being  33.67  for  the  young- 
est age  group  versus  38.80  for  the  oldest. 
The  difference  in  the  case  of  the  males 
is  in  part  due  to  the  fact  that  about  half 
of  the  normal  males  in  the  youngest  age 


For  the  100  normals,  all  correlations 
above  .20  are  significantly  greater  than 
zero  (P  <  .05);  for  the  50  abnormals, 
those  above  .27  are  so  significant  (7). 
The  correlations  with  F  and  L  probably 
underestimate  the  intensity  of  associa- 
tion since  the  distribution  of  these  two 
variables  is  so  markedly  skew  that  the 
conditions  for  the  Pearson  r  being  a 
maximally  descriptive  statistic  of  associa- 
tion are  assuredly  not  fulfilled.  The  im- 
portance of  an  exact  measure  of  covaria- 
tion was  not  such  as  to  warrant  the  ap- 
plication of  more  appropriate  descrip- 
tive statistics,  however. 

Of  particular  interest  in  the  table 
above  are  the  findings  which  bear  upon 
the  psychological  interpretation  of  the 
N  scale.  Although  detailed  discussion  of 
this  matter  will  be  given  in  the  following 
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chapter,  some  emphasis  may  be  placed  on 
certain  correlations  here.  Of  particular 
interest  are  the  following  relationships: 

1.  The  high  positive  correlation  of  N  with 
the  old  correction  scale  C,,. 

2.  The  marked  to  high  negative  correla- 
tions of  N  with  L„,  and  the  fact  that 

b 

these  correlations  are  higher  in  the  case 
of  the  abnormals.  Also  allied  to  this  the 
negative  correlations,  although  low, 
with  L. 

3.  The  high  positive  correlation  of  N  with 
Pt  and  Sc. 

4.  The  relatively  low  correlation  (signifi- 
cantly negative  for  normal  males)  with 
Hy,  despite  the  positive  association  with 
Hs. 

5.  The  positive  correlation  ^\ith  F. 

6.  The  general  tendency  to  significant  and 
in  some  cases  marked  positive  associ- 
ation with  the  various  abnormal  com- 
ponents, there  being  only  one  negative 
correlation  in  the  table  except  for  the 
lie  scales. 

The  fact  that  N  correlates  low  and  in 
one  group  significantly  negative  with  Hy 
and  yet  correlates  fairly  markedly  with 
Hs  suggested  another  correlational  study. 
The  Hy  and  Hs  scales  have  many 
"somatic"  complaint  items  in  common. 
The  Hy  scale  differs  not  only  in  that  it 
does  not  share  quite  all  of  the  Hs  somatic 
items  with  the  latter  (which  in  itself 
would  not  account  for  this  correlational 
difference),  but  more  importantly  the  Hy 
scale  contains  a  large  number  of  non- 
somatic  items,  including  those  "denial 
of  the  psychiatric"  responses  that  were 
mentioned  in  Chapter  II.  This  group  of 
items,  as  was  pointed  out  there,  have  the 
common  property  of  denying  psychiatric 
involvement  and  assert  oneself  to  be  ex- 
traordinarily well  adjusted  socially. 
These  items  are  responded  to  by  the 
hysteric  in  the  statistically  "normal" 
direction,  and  hence  are  recorded  "O"  in 
scoring  the  MMPI.   Examples  of  these 


items  and  the  responses  scored  for  Hy 
are: 

C-25  I  ha\  c  often  lost  out  on  things  because 
I  couldn't  make  up  my  mind  soon 
enough.     (F) 

D-52  I  think  most  people  would  lie  to  get 
ahead.     (F) 

E-43  When  in  a  group  of  people  I  have 
trouble  thinking  of  the  right  things 
to  talk  about.     (F) 

E-54  I  can  be  friendly  with  people  who  do 
things  which  I  consider  wrong.     (T) 

F-25  I  resent  having  anyone  take  me  in  so 
cleverly  that  I  have  to  admit  that  it 
was  one  on  me.     (F) 

G-40  In  walking  I  am  very  careful  to  step 
over  sidewalk  cracks.     (F) 

H-10  I  commonly  wonder  what  hidden  rea- 
son another  person  may  have  for 
doing  something  nice  for  me.     (F) 

Because  of  the  relations  in  the  correla- 
tion table,  plus  the  growing  hypothesis 
(after  abandoning  Rosanoff's)  that  the  N 
scale  was  at  least  in  part  detecting  a  ten- 
dency to  say  psychiatrically  imdesirable 
things  about  oneself  even  if  unwarranted, 
it  was  felt  that  there  might  be  a  correla- 
tion  between  N  and  these  "I-am-psychi- 
atrically-fine"  responses  found  in  hy- 
steria, and  that  this  correlation  should 
be  negative.  Accordingly,  the  correlation 
was  computed  for  a  sample  of  60  normal 
females  and  60  normal  males,  disregard- 
ing their  profiles.  However,  there  are  5 
items  out  of  the  18  which  are  common 
to  Hy  and  N  (all  five  incidentally  scored 
oppositely  on  the  two  scales).  To  elimi- 
nate the  effect  of  this  overlap  of  items 
another  correlation  coefficient  was  deter- 
mined, this  time  based  on  N  versus  the 
13  remaining  items  of  Hy  after  the  five 
overlapping  were  removed.  The  results 
of  these  calculations  are  in  Table  14. 

These  four  correlations  all  differ  very 
significantly  from  zero  (P  <  .01)  al- 
though they  do  not  differ  significantly 
from  one  another  in  passing  from  the 
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Table  14 

Correlation  of  Raw  Score  on   N  with  the  "O" 

Items  of  the  Hy  Scale,  for  60  Normals  of 

Each  Sex,  Both  with  and  without 

the  5  Items  Common  to  Both 


Group 


N  Versus  the 
Entire  iS 
"O"  Items      lappini^  Items 


N  \'ersus  the 
i^  Non-Over- 


60  normal  males 
60  normal  females 


•72 
.69 


-.68 
-•59 


first  to  the  second  columns  (P  >  .30).  We 
observe  that  there  is  a  marked  inverse 
association  between  raw  scores  on  N  and 
raw  scores  on  the  non-somatic  psychic- 
denial  items  of  the  Hy  scale,  and  that 
this  relation  is  only  to  a  small  or  insignifi- 
cant extent  a  function  of  the  actual  item 
overlap  between  the  two  scales  being 
correlated.  This  fact  may  be  said  to  sub- 
stantiate the  prediction  which  prompted 
the  investigation  of  the  relationship,  and 
to  some  degree  to  support  the  hypothesis 
regarding  the  natiue  of  N. 

THE   ITEM    OVERLAP   BETWEEN    N   AND 
OTHER    MM  PI    SCALES 

As  is  usual  in  MMPI  scales,  there  is 
considerable  overlap  of  items  derived  on 
N  and  on  other  scales.  Of  the  78  items 
scored  on  N,  42  also  appear  on  at  least 
one  other  scale  of  the  eight  personality 
components,  whereas  the  remaining  36 
do  not.  Some  of  these  remaining  36,  how- 
ever, appear  on  the  lie  scales  or  on  the 
old  Ch  scale.  In  the  case  of  overlapping 
items  the  mere  number  of  common  items 
is  slightly  misleading  inasmuch  as  often 
the  items  are  not  scored  in  the  same 
direction  on  both  scales.  Table  15  shows 
the  number  of  items  common  to  N  and 
each  of  the  other  scales,  and  indicates 
the  frequency  of  items  in  which  the 
direction  scored  on  N  is  the  same  or  re- 
versed. 

For    most    of    the    personality    scales 


proper  the  overlap  is  quite  small.  The 
largest  amount  of  item  community  for 
any  of  the  personality  components 
proper  is  only  7  per  cent.  In  many  cases 
it  will  be  observed  that  the  distribution 
of  responses  scored  same  and  opposite  is 
nearly  equal.  The  most  marked  devia- 
tions from  this  trend  arc  on  Pt,  C,„  L,.,  G, 
and  "  +  ."  In  these  five  cases,  every  item 
on  N  is  scored  either  consistently  the 
same  or  consistently  the  opposite  as  it  is 
on  the  other  scale  compared  with  N.  In 
the  case  of  the  first  three  scales  in  ques- 
tion, the  direction  of  scoring  is  in  line 

Table  15 

Item  Overlap  between  N  Scale  and 

Other  MMPI  Scales 

(Number  of  N  Items  =  78) 


Num- 

Num- 

Num- 

Com- 

ber 

ber 

Scale 

ber  of 

mon 

Scored 

Scored 

Items 

Items 

the 
Same 

Oppo- 
site 

L 

15 

I 

0 

I 

F 

64 

6 

2 

4 

Hs 

33 

3 

2 

I 

D 

60 

9 

4 

5 

Hy 

60 

9 

3 

6 

Pd 

5° 

6 

5 

I 

Pa 

40 

2 

I 

I 

Pt 

48 

7 

7 

0 

Sc 

78 

4 

2 

2 

Ma 

46 

4 

0 

4 

Mf 

60 

6 

3 

3 

Ch 

48 

13 

13 

0 

Cd 

II 

2 

I 

I 

u 

22 

7 

0 

7 

G 

62 

16 

16 

0 

4- 

S6 

II 

II 

0 

with  the  correlations  found  for  the  scales 
as  a  whole.  Pt  and  C,!  are  positively  as- 
sociated with  raw  scores  on  N,  whereas 
Lg  is  negatively  associated.  The  signifi- 
cance of  the  item  overlap  with  the  un- 
published scales  G  and  "  +  "  is  not  clear 
because  the  psychological  (and  even  clini- 
cal) nature  of  these  scales  is  unknown, 
they  having  been  derived  by  purely  sta- 
tistical methods  analogous  to  factor  an- 
alysis without  regard  for  any  criterion 


-1U 

i 
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or  clinical  significance.  So  far  as  any  evi- 
dence exists  regarding  G  and  "  +  "  they 
represent  some  sort  of  a  general  factor 
of  abnormal  answering  (whether  due 
to  test-attitudes  or  personality  factors 
proper  is  not  known)  which  permeates 
the  MMPI  items  rather  generally. 

AN    EXAMINATION    OF   THE   TEN    MOST 
POTENT   ITEMS  ON   N 

There  is  not  any  very  obvious  psycho- 
logical interpretation  of  the  whole  group 
of  N  items  which  can  be  easily  made 
from  an  inspection  of  their  actual  verbal 
content.  As  has  been  pointed  out  previ- 
ously, they  do  have  the  common  property 
(aside  from  the  differentiating  one  which 
was  used  to  identify  them)  of  admitting 
to  personal  traits,  attitudes,  and  experi- 
ences which  are  not  admitted  by  the  ma- 
jority of  the  general  population  and  have 
the  character  in  most  cases  of  being  "un- 
desirable" or  "maladjusted"  responses.  It 
is  of  interest  to  examine  the  ten  most 
powerful  differentiators  as  judged  by 
their  proportional  differences  in  the 
criterion  groups.  These  ten  most  dis- 
criminating items  are  as  follows: 

ten  most  powerfully  discriminating 

items  of  n,  with  the  response 

scored  for  the  more 

"normal"  indicated 


C-35 
E-49 

E-52 
F-7 

F-47 


G-28 

H-35 
H-36 
H-41 
H-52 


I  liked  school.     (F) 

It  does  not  bother  me  that  I  am  not 

better  looking.     (F) 
People  often  disappoint  me.     (T) 
What   others    think   of   me    does    not 

bother  me.     (F) 
Often,  thovigh  everything  is  going  fine 
for  me,  I  feel  as  though  I  don't  care 
about  anything.     (T) 
I  am  not  easily  angered.     (F) 
A  windstorm  terrifies  me.     (T) 
I  am  not  afraid  of  fire.     (F) 
I  have  no  fear  of  water.     (F) 
I  have  no  dread  of  going  into  a  room 


by  myself  where  other  people  have 
gathered  and  are  talking.     (F) 

E-52,  F-47,  G-28,  H-35,  H-36,  H-41  do 
not   appear   on   any   of   the   personality 
scales   proper  of  MMPI.  Two  of  them 
appear   on   C,,   and   one   appears  on   L^ 
(scored  reversed).  All  ten  of  these  most 
potent   items   are   responded    to    in    the 
statistically  "unustial"  fashion  more  often 
by  the  criterion  normals  than  by  the  ab- 
normals;  and  as  is  apparent  from  reading 
them,  they  are  all  responded  to  by  the 
normals  in  the  direction  which  would  be 
considered     superficially     indicative     of 
mental    ill    health    and    maladjustment. 
The  responses  to  H-35,  H-36,  and  H-41 
are    certainly    very    mysterious    as    indi- 
cators of  "normality,"  as  are  the  others 
also  to  a  variable  extent.  The  writer  is 
not  prepared  to  defend  any  hypothesis 
as  to  the  common  property  of  these  items 
psychodynamically,    nor    even    to   assert 
that  they  have  a  common  property.  There 
may  be  several  "suppression"  variables 
operating  via  these  different  items  such 
that  it  would  be  a  mistake  to  attempt  an 
interpretation  to  hold  for  all  of  them. 
This  question  will  be  discussed  further 
in  Chapter  VI. 

item    OVERLAP    OF    N    WITH    THE    "NOR- 
MAL"    COMPONENT    OF    THE    HUMM- 
WADSWORTH  TEMPERAMENT  SCALE 

Since  the  "normal"  component  of  the 
Humm-Wadsworth  Scale  was  designed 
by  the  authors  to  measure  the  hypotheti- 
cal normality  or  control  factor  of  Rosan- 
off  which  gave  rise  to  the  present  investi- 
gation, it  is  of  interest  to  inquire  what 
relation,  if  any,  holds  between  the 
Humm-Wadsworth  normal  scale  and  the 
present.  There  are  no  correlational  data 
available  at  the  present  writing,  but  some 
idea  may  perhaps  be  gained  by  a  study 
of  item  overlap  between  N  and  the  nor- 
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mal  component  of  the  Hiunni-Wads- 
worth.  Of  the  78  items  on  N  and  the  38 
on  the  normal  conijDonent  of  the  Hmnm- 
^Vadsworth,  we  find  that  there  are  seven 
items  in  common  with  ahnost  identical 
wording.  The  items  in  question  are  indi- 
cated with  their  item  numbers  in  the  two 
tests: 

N-scale  Normal  scale  of 

from    MAI  PI  Humm-Wadsxvorth 

C-25  73 

D-13  109 

D-52  94 

F-13  282 

G-27  28 


I-'5 
1-30 


97 
279 


Although  the  nimiber  of  items  is  too 
small  to  draw  any  definite  conclusions, 
it  is  Avorth  mentioning  that  in  every  one 
of  these  seven  items,  the  response  scored 
for  "normal"  on  the  present  N-scale  is 
opposite  to  the  direction  scored  for  "N" 
on  the  Humm- Wads  worth.  It  seems  un- 
likely from  this  finding  alone  that  the 
two  are  closely  related,  and  it  may  be 
that  their  correlation  is  negative  as  would 
be  suggested  by  this  finding. 

A    MINOR    EXPERIMENT    IN    "bLIND" 
DIAGNOSIS    UTILIZING    N 

The  distributions  of  criterion  and  test 
cases  and  the  tests  of  significance  indicate 
whatever  amount  of  validity  the  N  scale 
possesses  for  the  discrimination  of  devi- 
ant-scoring normals  from  deviant-scoring 
clinical  abnormals.  However,  a  minor 
experiment  was  done  involving  an  at- 
tempt to  discriminate  a  group  of  profiles 
obtained  from  supposedly  "normal"  per- 
sons from  a  group  obtained  from  hos- 
pitalized abnormals,  using  the  N-scale  as 
a  basis.  Three  separations  were  of  in- 
terest: 

1,  The    separation    obtained    when     the 


judge  merely  looked  at  the  profiles  and 
was  asked  to  pick  out  the  actual  abnor- 
mals, in  the  absence  of  any  N  score  on 
the  profile. 

2.  The  separation  obtained  when  the 
judge  was  in  situation  (1)  except  that 
the  N-score  had  meanwhile  been  added. 

3.  The  separation  obtained  when  the  pro- 
files were  mechanically  sorted  in  terms 
of  the  magnitude  oi  the  N-score  only, 
with  no  "judgment"  based  upon  profile 
form  and  amount  ol  elevation  being 
permitted. 

In  order  to  reduce  the  difficulty  and 
the  fatigue  of  making  the  separation,  the 
number  of  cases  employed  was  not  great; 
and  this  fact  increases  the  difficulty  of  at- 
taining statistical  significance,  by  some 
tests  at  least.  There  were  chosen  at  ran- 
dom (every  third  case,  oinitting  college 
cases)  from  the  entire  group  of  deviant 
scoring  normals  (persons  with  any  T  ^ 
70  but  out  of  the  hospital)  22  cases  for 
"test  normals."  These  were  all  males  be- 
cause of  the  lack  of  sufficient  test  cases 
among  the  females,  which  would  have 
resulted  in  too  large  a  share  of  the  nor- 
mals being  members  of  the  criterion 
group,  thtis  unduly  biasing  results  in 
favor  of  the  scale.  The  profiles  for  these 
22  male  "normals"  showing  deviant 
scores  on  at  least  one  MM  PI  scale  were 
then  drawn  by  a  third  party. 

This  third  party  also  recorded  profiles 
for  22  hospitalized  abnormals  taken  at 
random  from  the  hospital  files,  excluding 
cases  wdth  almost  certain  invalidity 
judged  by  elevated  ?  and  F.  Since  the 
writer,  who  was  to  be  one  of  the  judges, 
kneio  a  priori  that  all  of  the  "normals" 
would  have  at  least  one  score  T  ^  70 
(having  been  selected  for  that  reason), 
it  was  necessary  to  include  among  the 
abnormals  only  those  having  abnormal 
profiles.  The  result  is  naturally  to  make 
the  problem  more  difficult  for   the  N- 
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scale,  since  from  the  previous  findings 
we  know  that  abnormals  with  non-devi- 
ant profiles  tend  to  score  high  T's  on  the 
N-scale.  All  of  the  present  44  profiles,  in 
other  words,  were  "deviant"  in  that  at 
least  one  of  the  eight  personality  scales 
showed  T  ^  70. 

1  hcsc  44  profiles  were  randomized  by 
the  third  party  to  the  experiment  and 
presented  simply  as  profiles  with  T-scores 
indicated  at  the  bottom,  but  without  any 
score  on  N  having  yet  been  recorded  on 
them.  Two  judges  were  tried  in  the  sepa- 
ration, and  the  separation  of  the  first 
judge  was,  of  cotirse,  not  known  to  the 
second.  One  of  the  judges  was  Professor 
S.  R.  Hathaway,  co-author  of  the  MM  PI; 
the  other  was  the  present  writer.  The 
information  accessible  to  the  judge,  be- 
sides the  profile,  was  the  statement  that 
of  the  44  cases,  22  were  in  fact  the  records 
of  persons  clinically  abnormal  and  hos- 
pitalized; whereas  the  remaining  22  were 
profiles  of  persons  who,  so  far  as  known, 
were  psychiatrically  "normal"  and  were 
not  under  a  doctor's  care  at  the  time  of 
taking  the  test  but  were  out  in  the  com- 
munity. The  judge  was  instructed  that 
he  must  di\ide  the  44  into  22  he  guessed 
"normal"  and  another  22  he  guessed  "ab- 
normal." This  restriction  as  to  how  many 
could  be  put  in  each  category  was  im- 
posed on  Professor  Hathaway  because 
the  writer,  of  course,  knew  the  group  to 
be  divided  evenly  while  he  was  making 
his  judgments. 

There  was  a  difficidty  arising  from  the 
entire  design  which  showed  up  in  mak- 
ing the  statistical  analysis,  namely,  that 
the  judges  were  sufficiently  "good"  at  dis- 
criminating the  normals  from  the  ab- 
normals regardless  of  N  that  in  order  for 
a  significant  difference  in  proportion  cor- 
rect to  be  algebraically  possible  with  44 
cases  the  discrimination  using  N  would 


have  to  be  practically  perfect. 

After  the  classification  had  been  made 
by  Professor  Hathaway  using  the  profile 
without  N,  the  writer  made  his  classifi- 
cation without  N.  Then  the  N  scale  was 
added  by  the  third  party,  and  the  writer 
attempted  another  separation.  The 
second  separation  by  the  writer  was  done 
several  days  after  the  first,  biu  it  is 
possible  that  the  earlier  sorting  (although 
its  correctness  was  not  checked)  exerted 
its  influence  on  the  latter.  Nevertheless, 
the  writer  was  found  in  subsequent  an- 
alysis to  have  made  a  total  of  18  changes 
(nine  exchanges)  in  the  second  sorting. 

For  piuposes  of  statistical  analysis  of 
the  results  of  these  three  separations  (two 
by  the  same  judge  but  differing  in  that 
the  N  scale  was  on  the  profile  in  making 
the  second  separation)  the  null  hypo- 
thesis was  assumed  to  hold  for  a  four- 
fold table.  The  goodness  of  separation 
was  tested  by  the  deviation  of  values  in 
the  four-fold  table  from  this  null  hypo- 
thesis, using  chi-square  as  the  test  of  sig- 
nificance. On  the  hypothesis  that  the 
judges  could  not  actually  discriminate, 
the  four  entries  in  the  table  should  be 
equal  within  errors  of  sampling.  A 
fourth  separation  was  carried  through  in 
which  the  entire  set  of  44  profiles  was 
simply  arranged  in  order  of  magnitude 
of  the  T-score  on  N,  a  median  drawn 
between  the  22nd  and  23rd  case  so  or- 
dered, and  the  top  half  of  the  cases  me- 
chanically labeled  as  "abnormal"  with 
the  bottom  half  labeled  "normal."  The 
division  occurred  at  between  T  =  45  and 
T  =  46,  about  one-half  sigma  below  the 
general  population  mean.  This  latter 
fact  probably  reflects  in  part  the  arbi- 
trary exclusion  of  abnormals  with  non- 
deviant  profiles,  who  occur  in  some  con- 
siderable number  in  practice  and  would 
almost  certainly  have  raised  the  median 
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score  on  N  had  they  been  included.  Suf- 
fice it  to  say  that  this  fourth  separation 
was  used  to  compare  the  rcsidts  of  a 
purely  statistical  and  mechanical  use  of 
N  with  the  judgment  of  persons  more  or 


cally  significantly,  at  least  with  this 
number  of  cases  involved)  by  adding  N 
to  the  profile.  The  most  important  find- 
ing here  is  probably  that  the  N  scale 
used    in   a   mechanical   fashion   does   al- 


Tablu  1 6 
Results  in  the  Four  Separations  in  the  "Blind"  Diagnosis  Experiment 


Separatiiin 


Per  Cent 
Correct 


Chi- 
Squarc 


Sipnificanre 
of  Deviation 
from  Chance 


1.  Hatha\va\,  witliout  N-score 

2.  Meehl,  witlunit  N-score 

3.  N-scalc,  working  mcchanicalK- 

4.  Meehl,  judging  with  N-score  avail- 

able 


59 

1-455 

P>.i6 

Not  significant 

68 

5.818 

P=.025 

Borderline 

73 

9.091 

P<  .003 

Significant 

77 

13.091 

P<  .001 

Significant 

less  "expert"  with  regard  to  the  clinical 
interpretation  of  MMPI  profiles. 

The  restdts  of  these  four  separations 
are  given  in  Table  16  below,  in  terms  of 
the  value  of  chi-square  from  the  four- 
fold table  as  w^ell  as  simply  the  per  cent 
of  "correct"  judgments  out  of  44. 

We  see  that  the  first  judge  does  not 
succeed  in  discriminating  the  normal 
from  the  abnormal  profiles  significantly 
better  than  chance  would  allow.  The 
second  judge  does  somewhat  better, 
achieving  a  deviation  from  chance  that 
has  borderline  statistical  significance  be- 
tween the  5  per  cent  and  the  1  per  cent 
level.  The  separation  of  the  second  judge 
is  improved  somewhat  (but  not  statisti- 


most  as  well  as  the  better  judge  when  he 
makes  use  of  it,  and  does  better  than 
either  judge  working  in  its  absence.  It 
should  be  pointed  out  that  the  second 
judge  may  have  been  aided  somewhat 
unconsciously,  both  in  his  matching  with 
and  without  the  N-scale,  by  having  seen 
a  few  of  the  profiles  several  months  pre- 
viously in  another  context,  although  it  is 
doubtful  whether  this  could  have  exerted 
any  marked  effect.  In  any  case,  the  results 
using  N  alone  are  of  the  greatest  signifi- 
cance in  the  above  table.  Even  including 
cases  with  non-triad  elevations  and  arti- 
ficially holding  all  abnormal  profiles  up, 
it  appears  that  N  has  sufficient  differenti- 
ating power  to  be  of  practical  value. 
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Chapter  VI 

PSYCHOI,OGICAL    INTERPRETATIONS    OF    N 

THE    PREVIOUS    treatment    of    the    N-  upon  study  of  the  items.  It  needs  also  to 
scale  in  this  paper  has  been  confined  l)c  pointed  out  that  all  of  the  78  items 
almost  entirely  to  a  consideration  of  its  probably  do  not  "function"  in  the  same 
statistical  properties.  Dynamic  functions  way  to  perform  the  empirical  discrimina- 
have  been  loosely  attributed  to  the  score  tions   found,   so   that   further  statistical 
in  remarks  such  as  "An  abnormal  female  and  experimental  study  of  N  needs  to  be 
with  all  personality  components  under  done    in   order    to    isolate,    at   least   for 
T  =  65  cannot  easily  hold  her  T-score  theoretical   reasons,    the   components   of 
on  N  under  50"  and  similar  comments;  which  it  is  very  likely  a  composite, 
but  no  serious  effort  has  been  made  to  What  we  know  of  N  experimentally  is 
give  a  theoretical  and  psychodynamic  ex-  that  it  seems  to  enable  us  to  distinguish 
position  of  what  N  is  behaviorally  and  persons  who  show  deviant  MMPI   pro- 
how    it    produces    the    discriminations  files  but  remain  clinically  "normal"  from 
which  it  does.  persons  with  equally  deviant  profiles  who 
The  present  chapter  is  only  a  tentative  do  not.  This  tendency  is  well  established 
effort  in  that  direction.  It  is  characteristic  in  the  case  of  the  scores  on  the  neurotic 
of  research  on  the  MMPI  that  very  little  triad,  but  it  also  holds  up  statistically 
attention  is  paid  to  the  actual  content  of  although  not  usefully  for  the  other  non- 
items,    nor   is   it    visually   asked    "How"  triad  components.  There  is  only  a  slight 
patients  of  a  given  description  come  to  tendency  (none  for  one  sex)  for  it  to  dis- 
respond  in  the  way  they  empirically  are  criminate  between  normals  generally  and 
found  to  do.  In  some  cases,  as  in  many  abnormals,  when  the  rest  of  the  MMPI 
of  the  items  of  the  D-scale  and  all  of  the  profile  does  not  enter  into  consideration, 
items  on  the  Hs  scale,  the  relation  of  It  seems  appropriate  at  this  time  to 
the  scored  response  to  the  clinical  entity  summarize  the  reasons  which  have  ac- 
discriminated  is  "obvious,"  i.  e.,  one  can  cumulated     throtighout    the    preceding 
see  (or  so  he  thinks)  luhy  a  depressed  or  pages  for  not  considering  N,  at  least  the 
hypochondriacal  person  would  come  to  major  part  of  it,  as  measuring  the  "nor- 
respond  in  the  way  indicated  by  the  scor-  mality"  factor  hypothesized  by  Rosanoff 
ing  key.  In  many  if  not  most  cases,  how-  and  measured  (purportedly)   by  the  N- 
ever,   this  relation  is  either  tenuous  or  component  of  Humm  and  Wadsworth. 
altogether  lacking.  In  the  case  of  N  we  I  submit  that  the  following  findings  ren- 
are  in  the  unfortunate  position  of  having  der  an  interpretation  of  N  in  these  terms 
to  reason  backwards  from  the  properties  extremely  unplausible: 
of  the  scale  and,  to  a  lesser  extent,  from  1.  The  responses  scored  for  normality 
the  nature  of  the  items,  to  a  guess  as  to  on  the  present  N-scale  are  in  the  vast 
its    psychological    "nature."    From    the  majority  of  instances  the  statistically  ab- 
findings  in  regard  to  other  scales,  it  must  normal  responses  from  the  standpoint  of 
be  obvious  that  interpretations  must  of  what  most  persons  say  to  these  items, 
necessity  be  held  with  only  very  slight  2.  The  responses  scored  for  normality 
confidence  in  so   far  as  they  are  based  are    with    practically   no   exceptions   re- 
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sponses  which  admit  to  personal   traits  positive  "normal"  component  of  the  kind 

and  attitudes  of  the  sort  that  are  psy-  in  question. 

chiatrically    undesirable    and    indicative  6.  The  raw  scores  on  N  correlate  high 

prima  facie  of  maladjustment  and   un-  negative  with  a  subtle  lie  scale,  and  to  a 

happiness,  assuming  them  to  be  an  ade-  lesser  but  still  negative  extent  with  the 

quate  description  of  the  person.  old   lie  scale.   While   this   fact  does  not 

3.  The  raw  score  on  N,  which  tends  contradict  the  Rosanoff  interpretation 
to  be  high  among  "normals"  who  have  of  N,  still  it  remains  quite  unexplained 
deviant  profiles  but  low  among  abnor-  on  that  basis. 

mals,   shows   a  marked   and  statistically  7.  In   the  case  of  the  non-triad  com- 

significant  deviation  in  the  low  direction  poncnts    of    abnormality,    it    has    been 

in    unselected    college    students    and    in  pointed  out  that  if  a  person  shows  devi- 

adults  from  the  normal  population  who  ations  on  these  components  and  has  a 

have    previously    attended    college.    To  loio  raw  score  on  N,  he  is  almost  certain 

reconcile  this  finding  with  the  interpre-  to  be  psychiatrically  incapacitated  to  the 

tation  in  question  it  would  be  necessary  extent  of  being  in  a  hospital.  Whereas  if 

to  assert  that  college  students  and  col-  he  has  high  raw  scores  on  N,  he  is  by  no 

lege-educated  persons  have  for  some  un-  means  guaranteed  against  such  a  condi- 

known  reason  considerably  less  normal,  tion.  The  hypothesis  in  question  posits 

inhibiting,  and  controlling  components  a  relation  working  both  ways — too  little 

in    their   personalities    than    persons    in  "control"  will  resvdt  in  break-down  in 

general,  and  in  fact  that  such  college  per-  the  presence  of  slightly  elevated  abnor- 

sons  have  even  less  of  this  normalizing  mal  tendencies;  but  conversely,  an  excess 

factor  than  an  unselected  sample  of  psy-  of  control  is  supposed  to  prevent  the  ap- 

chotic  and  neurotic  persons.  pearance  of  abnormality  even  when  such 

4.  The  N-scale  differentiates  only  to  a  tendencies  are  present. 
sli2:ht  extent  between  female  abnormals  8.  There  are  seven  items  on  the  • 
and  females  from  the  general  normal  present  scale  which  occur  also,  with 
population,  and  it  does  not  differentiate  slight  modifications  of  wording  in  some 
male  abnormals  at  all.  Regardless  of  the  cases,  on  the  Humm-Wadsworth  "nor- 
magnitude  of  the  other  specific  compon-  mal"  scale.  All  seven  of  these  items  are 
ents,  on  the  Rosanoff  hypothesis  it  would  scored  in  the  opposite  direction  for  "nor- 
be  expected  that  at  least  on  the  average,  mal"  on  the  present  scale  from  the  way 
persons  who  have  broken  down  psy-  they  are  scored  on  the  Humm-Wads- 
chiatrically  would  have  less  of  the  "nor-  worth.  To  the  extent  that  the  latter  scale 
mal"  component  than  an  unselected  discriminates  between  normals  and  ab- 
group  of  normals.  normals  generally  and  functions  in  the 

5.  The  raw  scores  on  N  correlate  to  a  way  these  authors  describe  it  to,  this  fact 
significant  extent  with  the  abnormal  argues  against  making  an  interpretation  ^ 
components  of  MMPI,  both  among  nor-  in  terms  of  Rosanoff's  theory  for  the  .i 
mals  and  among  the  abnormals.  In  every  present  case.  jj 
case  but  one  these  correlations  are  posi-  For  these  reasons  the  interpretation  % 
tive,  and  in  some  cases  (e.  g.,  Pt)  they  are  of  N  in  terms  of  a  positive,  dynamic  per- 
very  high.  There  is  no  rational  way  to  sonality  component  of  "normality"  or 
interpret  such  a  finding  in  terms  of  a  "control"    which    inhibits    or    regulates                       f 
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those  components  of  temperament  which 
might  otherwise  result  in  psychiatric  up- 
set must  be  discarded.  This  is  not  to  be 
construed  as  meaning  that  Rosanoff's 
hypothesis  is  false,  nor  that  the  Humm- 
Wadsworth  normal  component  does  not 
measure  Rosanoff's  variable,  but  merely 
that  the  present  N-scale  cannot  be  inter- 
preted psychologically  in  that  w'ay.  It 
would  seem  that  if  such  a  factor  as 
Rosanoff's  does  exist,  somewhere  among 
the  495  items  studied  one  should  find 
items  loaded  with  it.  And  this  latter  may 
actually  be  the  case,  since  the  reasons  ad- 
duced against  such  an  interpretation  of 
N  apply  to  the  scale  "as  a  whole,"  i.  e.,  as 
manifested  in  the  correlational  and  dis- 
criminating properties  of  the  entire  set 
of  78  items,  which  are  probably  not,  as 
has  been  pointed  out  previously,  psycho- 
logically homogeneous.  It  is  conceivable 
that  the  minority  of  items  which  are 
"O"  responses  on  the  N  scale  are  more 
akin  to  the  kind  of  item  that  would 
sample  Rosanoff's  hypothetical  normal- 
ity factor.  On  the  other  hand,  it  may  be 
that  these  items  would  turn  out  to  be  of 
the  same  nature  as  some  of  those  on  the 
Humm-Wadsworth — merely  denials  of 
symptoms  of  such  severity  or  diagnostic 
portent  as  would  hardly  occur  in  normal 
persons.  Thus,  the  first  two  items  of  N 
are  what  one  would  a  priori  expect  to 
find  on  a  "normal"  scale: 

"I  am  troubled  by  attacks  of  nausea  and 
vomiting."     (F) 

"1  have  had  attacks  in  which  I  could  not 
control  my  movements  or  speech  but  knew 
what  was  going  on  around  me."     (F) 

If  all  of  the  items  on  N  were  of  this  sort, 
the  scale  would  be  trivial  and  thoroughly 
uninteresting  theoretically,  but  it  would 
not  be  difficult  of  interpretation.  If  such 
items  preponderated,  one  would  simply 
have  a  case  of  listing  those  self-reported 


symptoms  wdiich  would  practically  never 
occur  in  normal  persons  but  would  fre- 
cjuently  occur  in  psychoneurotics  of  the 
sort  who  score  high  on  the  neurotic  triad. 
To  be  "normal"  would  raise  your  score 
in  answering  such  items  in  the  indicated 
direction  simply  because  to  be  "normal" 
means  being  relatively  free  of  such  symp- 
toms. From  the  evidence  that  has  been 
adduced  up  to  now,  it  is  clear  that  the 
major  part  of  N  cannot  be  interpreted 
in  that  light  either. 

Another  possible  interpretation  of  N 
is  that  it  is  an  "insight"  scale  in  some 
sense.  The  persons  who  are  in  the  hos- 
pital with  Hs  scores  of  75  differ  from 
those  who  are  out  of  the  hospital  with  Hs 
scores  of  75  in  that  the  latter  have  more 
"insight"  into  their  own  behavior  and 
motivation  and  hence  avoid  the  actual 
development  of  symptoms,  even  though 
they  do  in  fact  possess  an  equal  amount 
of  "hypochondriasis-component."  This 
insight  variable  also  shows  up  in  a  will- 
ingness to  admit  to  oneself  the  sort  of 
thing  which  appears  on  the  N-scale. 

This  hypothesis  is  not  nearly  so  easy 
to  refute  as  the  hypothesis  which  posits 
Rosanoff's  positive  control  factor.  In 
fact,  a  number  of  the  findings  can  be 
brovight  to  bear  in  support  of  it.  The 
negative  correlation  with  L  and  L^  and 
with  the  18  "O"  items  of  the  Hy  scale 
would  fall  in  line  with  such  an  interpre- 
tation; for  it  has  always  been  empha- 
sized by  the  MMPI  authors  that  the  "lie" 
scale  does  not  really  mean  lie  but  a  ten- 
dency to  put  oneself  in  a  good  light, 
whether  conscious  or  unconscious.  The 
tendency  of  N  to  show  elevated  raw 
scores  in  cases  with  high  Pt  to  a  much 
greater  extent  than  in  cases  with  elevated 
neurotic  triad  might  also  be  brought  as 
evidence,  since  the  psychasthenic  is  no- 
toriously    more     "insightful"     at     least 
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verbally  than  the  hypochondriac  or  the  postulated    comes    to    he    primarily    in 

hysteric.  terms  of  the  responses  to  test  items,  and 

Opposed  to  such  an  hypothesis,  how-  not  in  terms  of  whatever  kind  of  insight 

ever,  are  nmnbers  of  bits  of  data.  First  might  protect  a  person  with  abnormal 

of  all,  on   this  assumption   there   is  no  tendencies  from  developing  the  clinical 

reasonable  account  of  the  scores  obtained  symptoms  of  a  neurosis. 

from  college  persons.  ^Ve  must  assume  The  correlation  of  N  with  Sc  and,  in 

that  the  average  college  student  or  col-  ihc  abnormals,  with  Pa  does  not  tend  to 

lege-educated    person    is   about   one   SD  sui)port    such    an    interpretation    either, 

below  the  mean  of  the  general  popula-  nor  do  any  of  the  positive  correlations 

tion  in  terms  of  how  nnuh  "insight"  he  witii    abnormal    components,   nor   those 

has  into  himself,  which  is  conceivable  to  with  G  and  "  +  ."  Furthermore,  it  should 

be  sure  but  it  is  not  supported  by  any  be  remembered  that  some  of  the  scales 

other  evidence.  In  the  second  place,  we  on  MMPI  are  "symptomatic"  themselves 

must  assume  that  if  a  patient  in  a  psycho-  and     cannot     readily     be     regarded     as 

pathic  unit  with  a  psychosis  or  psycho-  measuring  basic  components  which  ap- 

neurosis  is  not  deliberately  selected  for  pear  or  are  inhibited  from  appearing,  as 

having  a  "normal"  profde  (as  were  the  the  case  may  be.  Thus,  in  the  neurotic 

criterion  abnormals  to  some  degree),  he  triad   scales   on   which    the   N-scale   was 

has  as  much  of  this  insight  as  the  average  originally  derived,   to  get  a  high  score 

non-hospitalized  person — fully  as  much  often  simply  means  to  complain  of  those 

in  the  case  of  male  abnormals  and  only  symptoms    which,    if    actually    present, 

slightly  less  in  the  case  of  female  abnor-  would  entitle  one  to  be  called  whatever 

mals.  In  other  words,  the  insight  hypo-  diagnosis  is  written  above  the  score  on 

thesis   fails    to   account   for   the   finding  the  profile.  To  obtain  a  high  score  on 

that  abnormals  show  up  as  having  less  Hs,    for    example,    it    is    necessary    and            • 

insight  only  when  their  other  scores  are  sufficient  to  say  that  one  often  feels  weak 

restrained  from  appearing  abnormal,  but  all  over,  that  one's  hand  shakes  when  he                      •j^ 

not  when  these  other  scores  are  allowed  tries  to  do  something,   that  one's  head                     * 

to  vary  freely.  What  makes  the  difference  seems    to    hurt    all    over,    that    one    is 

here  is  the  scores,  not  the  presence  of  bothered  by  stomach  trouble  often,  that                     li 

abnormal  symptoms  themselves,  it  would  one  is  not  in  so  good  health  as  his  friends, 

appear.  If  I  am  enough  of  a  hypochon-  and  the  like.  These  statements  are  the 

driac  so  that  my  relatives  bring  me  to  an  coynplaints  which,  if  he  believes  them  and 

institution  for  psychiatric  treatment,  I  am  talks  about  them  and  acts  as  if  they  were 

likely  to  have  as  much  "insight"  as  any-  real,  define  the  hypochondriac  clinically. 

one   else   in   the  general   population   of  How  is  it  possible  to  say  that  the  "nor- 

normal  persons,  if  this  interpretation  of  mals"    who    have    sufficient    insight    as 

N  is  correct.  It  is  only  when  (in  spite  of  measured  by  N  avoid  being  hypochon- 

being  a  hypochondriac)   I  fail  to  show  driacs  in  spite  of  answering  the  items  in 

very  elevated  scores  on  the  neurotic  triad  this  way,  when  answering  the  items  in 

that  my  "insight"  may  be  expected  to  be  this  way  is  per  se  hypochondriacal?  Simi- 

poorer  than  average — and  according  to  larly,  in  the  case  of  depression,  to  say  "I 

the   findings,   not  very  much  poorer  at  cry  easily,"   "I  brood  a  great  deal,"   "I 

that.  In  this  line  of  thought,  the  insight  don't  seem  to  care  what  happens  to  me," 
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"I  am  not  happy  most  of  the  time,"  and  cases  in  which  the  personality  variable 

so  on,  would  seem  to  be  prima  facie  evi-  underlying  the  scale  response  does  not 

dence  of  actual  depression,  if  the  person  actually  exist  to  the  extent  indicated  by 

is  using  these  words  in  the  way  in  which  the  score.  In  terms  of  this  category,  the 

most  people  use  them  to  describe  their  "normals"  with  depression  scores  of  85 

psychological  condition.  are  not  out  of  the  hands  of  the  psychia- 

Lastly,    consider    the    N    items    them-  trist  because  they  have  a  lot  of  "control" 

selves.  Persons  who  receive  deviant  scores  which  holds  this  depression  component 

on  the  neurotic  triad  but  say  that  they  in  check;  nor  arc  they  outside  because 

are    easily    angered,    that    a    windstorm  they   have   failed   to   show   certain   very 

terrifies  them,  that  they  are  afraid  of  fire,  gross  signs  and  symptoms  as  part  of  their 

that  they  have  a  fear  of  water  (to  men-  depression;  nor  are  they  outside  because 

tion  some  among  the  most  potent),  have  they  have  enough  "insight"  to  keep  their 

a  much  better  chance  of  being  out  of  the  depression     from    incapacitating    them, 

hospital   than   those  w^ho  do  not  assert  They  are  most  probably  outside  because 

these    unpleasant    things    about    them-  they  are  simply  not  nearly  so  depressed 

selves.  It  seems  stretching  the  point  con-  (in  the  biophysical  sense)  as  the  people 

siderably    to   say   that   for   a   person    to  inside,  and  their  depression  score  of  85 

admit    that    a    windstorm    terrifies    him  is  spuriously  high  because  of  some  other 

shows  he  has  more  insight  than  another  personality  or  verbal  factor.  The  present 

person   who   does    not    admit   this,    and  hypothesis    is    that    it    is    this    factor   of 

that,  if  they  both  have  an  equal  amount  "plus-getting"  that  is  detected  by  the  N- 

of   hypochondriacal,    depressive,    or   hy-  scale. 

steroid  components,  the  former  is  more  In  terms  of  the  list  of  possibilities 
likely  to  remain  out  of  the  hands  of  a  given  under  "Errors  of  Measurement," 
psychiatrist.  It  seems  much  more  parsi-  categories  A-i  and  A-2  have  been  system- 
monious  to  the  present  writer  to  doubt  atically  excluded  in  the  present  study  by 
that  all  the  "normals"  who  said  "A  wind-  the  insistence  upon  very  rigorous  stand- 
storm  terrifies  me"  were  merely  mani-  ards  of  validity  in  terms  of  profile  form,  ? 
festing  more  honesty  and  insight  than  and  F  scales.  The  only  other  possibilities 
the  abnormals  who  were  equally  terri-  are  "Unusual  interpretation  of  the  mean- 
fied;  and  I  suspect  that  neither  group  was  ing  of  the  questions  of  a  sort  other  than 
especially  terrified  but  that  those  who  those  interpretations  related  to  the  per- 
said  that  they  were  terrified  were  not  sonality  trait  in  question,"  and  "The 
actually  so  at  all.  This  interpretation  patient  does  not  see  the  facts  as  they  are." 
brings  us  to  the  last  hypothesis,  which  It  has  been  pointed  out  previously  that 
will  be  defended  as  being  somewhat  more  unusual  interpretation  as  such,  and  not 
plausible  than  the  others.  seeing  the  facts  as  they  are  as  such,  do 

In  terms  of  the  list  of  possible  ways  in  not  necessarily  give  rise  to  test  misses, 

which  a  "test  miss"  can  occur  (Chapter  because  of  the  deliberately  projective  ele- 

II),  it  seems  to  the  writer  that  the  most  ment  in  the  whole  theory  of  the  MMPI. 

reasonable  place  to  assign  the  source  of  But  there  may  be  some  personality  traits, 

N's  variance  and  discriminating  power  as  well  as  some  quite  superficial  verbal 

is  under  Group  I,  the  "Errors  of  Meas-  patterns,  which  are  almost  unrelated  to 

urement"    category.    This    includes    the  the  variable  with  which  a  given  item  is 
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loaded  and  for  the  discrimination  of 
which  it  was  empirically  selected.  The 
traits  and  patterns  are  the  "irrelevant" 
components  of  the  predictor  as  described 
by  Horst  (16).  and  the  hypothesis  I  wish 
to  consider  seriously  is  that  the  N  scale 
acts  as  a  suppression  variable  for  some 
of  these  irrelevant  components. 

The  detailed  character  of  these  ten- 
dencies I  am,  of  course,  unprepared  to 
elucidate.  The  crudest  description  would 
be  to  say  merely  that  they  involve  the 
tendency  to  get  high  scores  on  personality 
tests  of  the  MMPI  variety,  which  is  say- 
ing little  more  than  the  empirical  find- 
ings except  that  it  points  the  way  to  an 
interpretation  in  terms  of  test-taking  as 
such  instead  of  one  in  terms  of  psychiatric 
personality  components  and  dynamisms 
as  did  the  other  hypotheses.  There  have 
been  obser\ed  by  Professor  Hathaway 
and  the  writer  a  number  of  persons  who 
were  excessively  preoccupied  with  their 
own  psychiatric  status,  who  were  rumi- 
native and  anxious  about  their  own  state 
of  mind  and  their  own  feelings,  who  dis- 
played the  "psychasthenoid  tempera- 
ment" in  a  modified  and  watered  form 
without  being  really  psychasthenic  in  the 
sense  of  interfering  compulsions  and  ob- 
sessions. Many  of  these  persons  are  per- 
petual "plusgetters"  on  personality  tests, 
and  in  fact  an  interest  in  them  was,  as 
has  been  pointed  out  before,  the  starting 
point  for  the  present  study.  When  the 
present  investigation  began,  the  Rosan- 
off-Humm-Wadsworth  hypothesis  had 
structured  the  field  so  that  the  failure  of 
these  high-scoring  "normals"  to  be  in  a 
mental  hospital  or  at  least  incapacitated 
for  their  normal  functions  was  envisaged 
in  terms  of  Category  II  instead  of  Cate- 
gory I.  On  the  present  evidence  it  seems 
more  fruitful  to  conceive  of  these  persons 
as   simply    mismeasured   cases,    and    the 


practical  problem  as  one  of  improving 
the  instrument  so  as  to  suppress  the 
verbal-personaliiy  components  that  lead 
to  the  mismeasurcmcnt. 

It  is  not  denied  in  the  present  hypo- 
thesis that  "personality"  factors  are  in- 
vohed  in  these  components,  whatever 
they  may  turn  out  to  be.  That  they  are 
allied  to  whatever  the  psychasthenia  scale 
measures  is  clear  from  the  correlation 
with  Ft,  as  is  also  the  case  for  Sc.  But  the 
difference  between  this  and  Category 
II-B-i  ("other  traits  inhibit  the  trait  in 
question")  is  that  the  other  traits  in- 
volved do  not  inhibit  the  measured  trait 
but  rather  prevent  it  from  being  accur- 
ately measured.  Thus,  a  high  N  score 
does  not  indicate  that  the  personality 
component  measured  by  N  or  Ft  is  acting 
to  "restrain"  a  man's  depression  com- 
ponent— an  interpretation  that  is  far 
fetched  to  begin  with  in  the  light  of  what 
psychasthenia  is  like  subjectively — but 
rather  that  a  man's  depression  compo- 
nent as  well  as  his  psychasthenia  com- 
ponent are  not  so  strong  as  the  scores  on 
D  and  Ft  would  indicate,  because  he  has 
an  excessive  amount  of  whatever  it  is 
that  is  measured  by  N  and  produces  high 
scores  on  psychiatric  scales. 

The  extent  to  which  N  involves  factors 
closely  allied  to  psychasthenia  proper  as 
contrasted  with  more  superficial  and 
semantic  trends  leading  to  high  scores 
on  Ft  and  Sc  alike  will  not  even  be  esti- 
mated here  because  there  is  no  evidence 
on  the  point.  The  correlation  with  Sc 
remains  mysterious  except  for  the  minor 
point  that  there  is  an  element  of  semantic 
confusion  and  unrealistic  thinking,  which 
is  common  to  these  pseudo-psychasthenic 
persons  and  to  schizoid  persons  alike, 
but  which  in  the  latter  cases  assiunes  a 
much  more  malignant  form.  "I  wish  I 
could  be  as  happy  as  other  people  seem 
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to  be"  may  represent  the  distress  of  an  does  not  have  much  of  this  plus-getting 

acutely  miserable  person  with  a  genuine  tendency,   the  "plus"  responses  he  gave 

depression.  But  it  may  also  represent  the  are  likely  to  indicate  a  genuine  psychi- 

superficial  comjilaint  of  nmiinative,  un-  atric  deviation.   On   the   other   hand,   a 

realistic,  semantically  dissociated  person  high  raw  score  in  an  abnormal  person  is 

who,  to  be  sure  may  be  feeling  blue  but  not  difficult  to  understand,  since  there 

has  no  real  conception  of  what  it  means  is  in  this  interpretation  no  reason  to  deny 

to  be  "depressed"  as  is  a  manic-dejjres-  a  lot  of  the  plus-getting  tendency  to  an 

sive.  It  is  something  akin  to  this  kind  of  abnormal   person   any   more   than    to   a 

characteristic  that  I  have  in  mind  when  normal  one. 

I  use  the  phrase  "plus-getting"  as  de-  It  must  be  admitted  that  the  present 
scriptive  of  N.  hypothesis  is  hardly  detailed  enough  to 
The  high  positive  correlation  of  N  be  theoretically  satisfying,  but  it  would 
with  Pt  and  Sc,  the  high  negative  cor-  be  illegitimate  to  do  more  than  hint  at 
relation  of  N  with  Lg,  the  high  positive  its  nature  on  such  flimsy  evidence  as  now 
correlation  with  Q,,  which  is  simply  a  exists.  To  design  experiments  having  this 
"correction  scale"  (suppression  variable)  hypothesis  in  mind  is  difficult,  for  it  is 
for  hypochondriasis  introduced  because  not  possible  to  use  as  criteria  of  "genu- 
the  old  H  scale  was  often  elevated  in  ine"  psychiatric  deviation  any  of  the  in- 
psychiatric  patients  free  of  hypochon-  dicators — responses  to  test  items,  self- 
driasis,  the  high  negative  correlation  of  descriptions  in  interviews — which  are 
N  with  the  "psychiatric  denial"  items  of  presumably  greatly  affected  by  the  factor 
the  Hy  scale,  all  of  these  findings  are  hypothesized.  One  possible  approach 
readily  understood  in  the  light  of  the  would  be  an  objective  (say,  multiple- 
hypothesis  that  N  measures  "plus-get-  choice)  test  based  upon  those  items  on 
ting"  tendencies.  That  such  a  trait  N  which  are  answered  in  the  scored 
should  show  up  correlated  positively  direction,  in  an  effort  to  find  out  whether 
with  almost  all  of  the  MMPI  scales  for  the  differences  in  interpretation  could 
abnormal  components  is  to  be  expected,  be  elicited  through  verbal  means  alone. 
That  it  should  differentiate  normals  with  Thus,  to  say  "I  am  afraid  of  fire"  may  be 
deviant  profiles  from  abnormals  with  a  sign  of  a  phobic  reaction  in  a  clinically 
deviant  profiles  is  understandable  since  compidsive-obsessive  patient.  But  this 
it  means  that  the  normals  are  not  deviant  response  also  scores  a  point  on  N,  in 
as  their  profiles  suggest.  On  the  other  which  case  it  may  merely  reflect  a  differ- 
hand,  that  normals  in  general  should  ence  in  interpretation  on  an  almost 
not  differ  much  from  abnormals  in  gen-  purely  semantic  basis.  There  is,  for  ex- 
eral  (unless  the  latter  achieve  normal  ample,  an  "obvious"  sense  in  which 
profiles)  is  also  imderstandable,  since  the  everyone  is  afraid  of  fire,  in  that  everyone 
N  scale  is  not  measuring  a  personality  will  have  anxiety  if  trapped  by  fire, 
trait  of  intrinsic  psychiatric  significance  everyone  will  avoid  putting  his  hand  in 
but  one  which  shows  up  as  significant  the  flames,  and  so  on.  Yet  the  item  fre- 
only  because  a  test  has  been  taken.  The  quencies  for  the  normal  population  show 
fact  that  a  low  raw  score  on  N  with  a  that  to  interpret  the  item  in  this  "ob- 
deviant  profile  indicates  clinical  abnor-  vious"  way  is  a  rare  thing.  The  majority 
mality   means   merely   that   if  a   person  of  "normal"  persons  do  not  say  they  are 
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afraid  of  fire,  presumably  because  they  thesis    has    the    merit    that,    unlike    the 

interpret  the  question  in  a  different  way.  others,   it  does  not  come  into  any  real 

A  criterion  "normal"  who  says  "I  often  conflict  with  this  finding,  although  it  can 

dream  about  things   that  are  best  kept  certainly  not  be  said  to  predict  it  as  a 

to  myself"  gets  scored  a   point  for  de-  logical    consequence.    Whether    college 

prcssion.  If  he  says  many  things  of  this  merely  selects  persons  of  sufficiently  high 

sort,  he  gets  a  high  T  score  on  depression.  verbal  intelligence  to  minimize  the  prob- 

The  unusual  semantics,  sclf-dcprecaiion,  abilities  of  such  semantic  distortions  as 

verbal  pessimism,  or  "psychiatric  hypo-  have  been  suggested  remains  to  be  veri- 

chondriasis"  Avhich  leads  him  to  respond  fied  or  refuted  by  further  experimenta- 

in  this  way  may  also,  however,  lead  him  tion.  But  that  college  educated  persons 

to  say  "I  am  afraid  of  fire"  and  by  so  should    have    less    of    a    tendency    than 

doing  raise  his  score  on  N.  How  often  people  generally  to  interpret  personality 

does    "often"    mean    when    he    says    he  test  items  in  such  a  way  as  to  put  them- 

often  dreams  about  things  best  kept  to  selves  in  a  "bad"  light  psychiatrically— 

himself.^   To   remove    the    ambiguity   of  possibly  also  as  a  result  of  greater  verbal 

that  "often"  would  lower  the  discrimina-  test  sophistication — is  at  least  a  good  a 

tion  of  many  MMPI  items,  by  reducing  priori  possibility  compared  with  the  im- 

them    to    mere    self-rating   devices.    But  plications  of  the  alternative  hypotheses 

when    we    leave    the   ambiguity    in,    we  regarding  N. 

leave  it  in  at  a  price,  due  to  the  existence  In  conclusion  it  must  be  admitted  that 
of  other  variables  besides  those  we  are  the  psychological  nature  of  the  chief  fac- 
testing  by  that  item.  If  N  is  loaded  also  tors  contributing  to  the  N  score  remains 
by  some  of  these  other  variables,  and  not  essentially  undetermined  on  the  present 
heavily  loaded  with  depression  itself,  evidence.  The  hypothesis  tentatively  sup- 
then  we  can  try  to  suppress  the  non-  ported  here  leads  to  interesting  and  im- 
depression  components  of  the  D-scale  by  portant  possibilities  for  investigation  of 
the  use  of  N.  An  intensive  study  of  devi-  the  whole  problem  of  improving  ques- 
ant-scoring  "normals"  with  high  and  low  tion-answer  type  personality  inventories. 
raw  scores  on  N  using  a  multiple-choice  As  was  suggested  in  Chapter  II,  it  is  not 
method  might  shed  some  light  on  this  inconceivable  that  the  future  develop- 
problem.  When  a  subject  says  "I  am  ment  of  this  type  of  personality  measur- 
afraid  of  fire,"  he  could  indicate  later  ing  device  will  be  aided  in  large  part 
whether  he  interpreted  the  item  to  mean  by  the  systematic  analysis  of  personality- 
"Thinking  of  fire  gives  me  anxiety,"  "I  test-taking  as  a  behavioral  process.  Such 
become  rather  nervous  when  near  a  large  study  will  certainly  increase  our  ability 
fire,"  "I  would  probably  be  frightened  to  construct  more  valid  personality  tests 
if  caught  in  a  burning  house,"  "It  is  a  over  what  would  be  possible  with  the 
fact  that  fire  can  be  dangerous  to  a  per-  more  traditional  approach,  which  con- 
son,"  and  so  forth.  ceives  of  such  tests  as  simple  self-rating 
Of  the  various  facts  collected  so  far  scales  in  which  "error"  is  accepted  as 
regarding  the  N-scale,  the  only  one  inevitable  because  of  the  basic  untrust- 
which  does  not  actively  support  the  worthiness  of  human  ratings  and  particu- 
present  interpretation  is  the  deviation  of  larly  those  ratings  demanding  "objec- 
college  groups.  At  least  the  present  hypo-  tive"  judgments  concerning  the  self. 


Chapter  VII 
Summary  and  Conclusions 


T 


HE  ORIGINAL  aim  of  the  present  in- 
vestigation was  the  isolation  of  a 
scale  of  items  on  the  Minnesota  Multi- 
phasic Personality  Inventory  which 
would  quantify  the  hypothetical  factor  of 
general  "normality"  or  "control"  as  de- 
scribed by  Rosanoff  and  purportedly 
measured  previously  in  the  Humm-Wads- 
worth  Temperament  Scale.  The  proce- 
dure was  one  of  empirically  isolating 
items  from  the  total  item  pool  of  the 
MMPI  on  the  basis  of  their  discrimina- 
tion between  a  criterion  group  of  ap- 
parently normal  persons  showing  marked 
deviations  on  the  three  scales  of  the 
"neurotic  triad,"  and  a  matched  group 
of  clinically  abnormal  persons  with  simi- 
lar profiles.  The  scale  of  78  items  result- 
ing from  this  analysis  was  called  N  for 
"normal"  although  this  term  here  indi- 
cates its  differentiating  power  only  and 
does  not  imply  any  particular  dynamic 
interpretation  of  its  operation. 

The  N  scale  was  found  to  have  a  fairly 
satisfactory  odd-even  and  test-retest  reli- 
ability, and  was  shown  to  continue  dis- 
crimination when  applied  to  a  new 
"test"  group  of  deviant-scoring  normals. 
There  was  also  evidence  of  some  degree 
of  generality  in  that  it  showed  statistically 
significant  differentiation  between  "test" 
normals  and  abnormals  deviating  on 
other  components  than  the  neurotic  triad 
on  which  the  derivation  was  based.  This 
latter  discrimination,  however,  was  not 
nearly  so  good  in  terms  of  overlap  and 
could  hardly  be  of  any  practical  utility. 
The  overall  finding  from  the  study  of 
the  neurotic  triad  seemed  to  be  that  the 
typical  deviant-scoring  normal  scores  in 
the  neighborhood  of  one  standard  devi- 
ation (raw  score)  above  the  mean  of  the 


general  population,  and  that  the  lower 
such  a  raw  score  becomes,  the  greater 
are  the  probabilities  that  the  person  is 
abnormal  clinically.  If  a  person  shows  a 
deviant  profile  and  his  N-score  is  as  small 
as  one  standard  deviation  below  the  nor- 
mal population  mean,  he  is  practically 
certain  to  be  psychiatrically  involved  to 
the  extent  of  actually  being  under  psy- 
chiatric care.  Exceptions  to  this  rule  are 
almost  completely  confined  to  persons 
who  are  in  college  or  have  been  gradu- 
ated from  college;  this  group  deviates  on 
the  average  a  full  standard  deviation 
from  the  mean  of  the  general  popula- 
tion. It  was  suggested  that  further  in- 
vestigation may  warrant  the  construction 
of  special  norms  for  the  college  group. 

It  was  shown  that  unselected  abnor- 
mals tend  to  score  about  the  same  as  the 
general  population,  with  a  slight  ten- 
dency in  the  case  of  female  abnormals  to 
show  lower  raw  scores  than  normals.  In 
the  case  of  abnormals,  there  was  evidence 
that  N  functions  as  an  "inverted  lie" 
scale  to  some  extent,  in  that  hospitalized 
abnormals  of  heterogeneous  diagnosis 
but  with  normal-appearing  MMPI  pro- 
files tended  to  show  low  raw  scores  on  N. 
A  minor  experiment  in  "blind"  diag- 
nosis indicated  that  the  use  of  N  by  an 
unskilled  person  in  a  purely  mechanical 
way  would  enable  him  to  differentiate 
the  profiles  of  normal  from  those  of  ab- 
normal persons  as  well  as  or  better  than 
could  be  done  by  the  judgment  of  per- 
sons highly  familiar  with  MMPI  and 
skilled  in  the  interpretation  of  profiles. 

It  was  discovered  that  N  was  correlated 
in  varying  degrees  with  the  other  scales 
of  MMPI,  in  almost  all  cases  significantly 
positive.  The  highest  correlations  were 
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with  the  scales  for  psychasthenia  and 
schizophrenia  among  the  personality 
components  proper,  and  with  a  discarded 
correction  scale  for  the  hypochondriasis 
component.  Negati\e  correlations  were 
foiuid  with  scales  indicating  a  tendency 
to  put  oneself  in  a  favorable  light,  ami 
with  a  sub-set  of  items  from  the  iiysteria 
scale  measming  a  tendency  to  deny  that 
one  has  social  or  psychiatric  maladjust- 
ments. There  was  also  fotmd  a  positive 
correlation  of  N  with  chronological  age. 

Study  of  the  items  themselves  shows 
that  they  are  in  the  very  great  majority 
of  instances  answered  by  the  "normals" 
who  have  abnormal  appearing  profiles 
in  the  direction  which  is  statistically  ab- 
normal in  the  unselected  general  popula- 
tion sample,  and  ^\'hich  furthermore  is 
the  direction  that  ordinary  judgment 
would  consider  the  unhealthy,  unhappy, 
or  psychiatrically  imdesirable  direction. 
Those  items  on  N  which  are  common  to 
both  it  and  the  "normal"  component  of 
the  Humm-Wadsworth  are  scored  in  the 
opposite  direction  in  every  case  on  the 
two  keys.  This  paradoxical  finding  com- 
bined with  differential  and  correctional 
data  leads  to  a  definite  rejection  of  any 
interpretation  of  N  in  terms  of  Rosan- 
off's  active  "normalizing"  component  as 
originally  described. 

Hypotheses  as  to  the  psychological 
nature  of  N  are  considered  briefly.  The 
interpretation  of  N  as  being  simply  a 
denial  of  the  most  severe  and  pathog- 
nomonic symptoms  is  immediately  re- 
jected because  of  its  failure  to  distinguish 
normals  from  abnormals  generally  to 
any  great  extent,  combined  wdth  the 
actual  content  of  the  items.  An  interpre- 
tation of  N  in  terms  of  the  greater  "in- 
sight" shown  by  the  deviant-scoring  nor- 
mals in  admitting  to  the  failings  scored 
on  N  is  admitted  as  more  plausible  but 


finally  rejected  because  of  the  college 
findings,  the  correlation  with  other 
scales,  the  lack  of  good  differentiation 
between  normals  and  abnormals  in  gen- 
eral, and  the  a(  tual  content  of  the  most 
potent  items. 

The  provisional  hy})othesis  offered  is 
that  the  N  scale  is  loaded  with  a  "plus- 
getting"  tendency  as  regards  personality 
tests,  possibly  allied  to  a  kind  of  spurious 
jjsychasthenia  shown  by  certain  persons 
who  often  achieve  extremely  pathologi- 
cal scores  on  such  tests  as  MM  PI  with  a 
minimmn  of  genuine  suffering  and  com- 
plaint. The  relative  contribution  of  ])er- 
sonality  factors  versus  non-personological 
\erbal  and  semantic  factors  is  unknown 
at  present.  It  is  suggested  that  the  mark- 
edly deviant  neurotic  scores  shown  by 
those  normals  who  also  have  high  raw 
scores  on  N  reflect  a  distortion  of  re- 
sponse attributable  to  a  component  the 
influence  of  which  must  be  artificially 
"suppressed"  by  the  use  of  scales  such  as 
N  in  order  to  reduce  test  misses.  In  other 
words,  abnormal  profiles  may  arise  from 
a  genuine  abnormality  as  indicated  by 
the  scale;  but  they  may  also  arise  from 
"deviations"  which,  while  possibly  re- 
lated to  various  aspects  of  personality, 
are  not  the  deviations  for  which  the 
scale  was  built.  To  correct  or  partial  out 
the  influence  of  these  other  components 
requires  that  we  first  obtain  measures  of 
them,  and  the  present  N-scale  is  tenta- 
tively offered  as  an  example  of  such 
measurement.  It  is  suggested  that  a  direct 
attack  upon  this  hypothesis  concerning 
N  might  be  made  via  the  use  of  multiple- 
choice  situations  applied  to  those  persons 
who  do  deviate  in  regard  to  their  N- 
scores,  in  order  to  find  out  whether  the 
semantic-verbal  distortions  hypothesized 
really  occur. 

Finally  even  though  the  differentiating 
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powers  of  the  present  N-scale  are  not 
nearly  so  gieat  as  could  be  desired  for 
practical  reasons,  the  mere  fact  that  sig- 
nificant differentiation  exists  with  the 
items  being  of  such  a  nature  and  scored 
in  such  a  direction  as  they  are,  indicates 
that  a  pursuit  of  this  lead  in  the  field  of 
personality  testing  may  be  well  rewarded. 


Once  we  have  progressed  to  the  point  of 
sophistication  which  abandons  the  ques- 
tionnaire as  a  simple  self-rating  situation 
and  infers  the  significance  of  responses 
from  the  groups  differentiated  rather 
than  the  converse,  new  and  important 
advances  may  be  expected  with  confi- 
dence. 
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